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“The truth is, the Science of Nature has been already too long made only 
a work of the Brain and the Fancy : It is now high time that it should return 
xofthe plainness and soundness of Observations on material and obvious 
things.” Robert Hooke, Micrographia, 1665, Preface. 


“The present tendency is to loaf and to generalize... We need men not 
afraid to work : who will get busy the adding machine and the logarithms and 
give us exact studies, such as we get from the psychological laboratories. So- 
ciology can be made an exact, quantitative science if we can get industrious 
men interested in it.” F. H. Giddings, Amery Jour. Sociol., 15:196, 1909-10. 


“The critical'habit of thought, if usual in a society, will pervade all its 
mores, because it is a way of taking up the problems of life. Men educated 
in it cannot be stampeded by stump orators and are never deceived by dithy- 
rambic oratory. They are slow to believe. . .” W. G. Sumner, Folkways, 
1907, pp. 632-33. 


“The fundamentals of the science which our century inherited from its 
predecessors have been modified, now slightly, now quite perceptibly, till our 
outlook on the physical universe today bears but little resemblance to that 
of only thixty years ago. Great and striking as these advances are, there has 
been another, most rapidly developed since 1930, which has been slowly gath- 
ering momentum for all of 2300 years, which is of far deeper significance for 
“truth” — or Pilate’s query — than any of the radical advances of science of the 
past thirty years. Being more fundamental, more radical, and simpler than 
any of the spectacular advances in science, naturally this new advance has 
esca) the notice which its far-reachi g importance merits. Yet it is of 
profound significance for all theorizing and truth-seeking, scientific or other.” 
—E. T. Bell, The Search for Truth eynal and Hitchcock, 1934), pp. 7-8. 
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“Some of the gratuitous dualisms done ayamih [by modern biology, cul- 
tural anthropology, and the experimental met od] . . . are those of the objec- 
tive and subjective, the real and apparent, the mental and physical, scientific 
physical objects and objects of perception, things of experience and things-in- 
themselves concealed behind experience, the latter being an impenetrable veil 
which prevents cognitive access to the things of nature.” — John Dewey, “Ex- 
erience, Knowledge, and Value: A Rejoinder.” (The Philosophy of John 
ewey, P, A. Schilpp, Editor, Northwestern University Press, 1939), P- 524. 


PREFACE 


TO THE FIRST EDITION S 


The advance of physical science has been chiefly dependent 
upon the refinement of the technique and tools of observa- 
` tion and description. In the social sciences this technique 
and these tools are as yet largely undeveloped: There is a 
large number of treatises on “Principles” of Sociology, Eco- 
nomics, and Political Science. As hypotheses, these Princi- 
ples are of unquestionable value. But the hope for the ad- 
vancement of the social sciences lies in the testing of these 
principles by a patient accumulation of observed data, scien- 
tifically classified and generalized. The technique of such 
observation and classification is today the aspect of the social 
studies which is most in need of development. Á, y 

To be sure, the details of an adequate technology and 
methodology in a given case can be developed only as we 
work on specified problems. “The thought of an end,” says 
John Dewey, “is strictly conrêlative to perception of means 
and methods. Only when and as the latter become clear 
during the serial proċess of execution, does the project and 
guiding aim and plan Become evident and articulated.” It 
would certainly be absurd to attempt to prescribe a rigid 
method of procedure to be followed in all inquiries. How- 
ever, there are certain principles of logic, certain common 
dangers of bias, and certain common general approaches, 
which may be treated as generally present in all research. 
It is with these general principles and methods in the social 
sciences that this book is concerned. wi 

The present work has been produced primarily in order 
to bring together and to make available in compact form ma-, 
terials which the author has found useful in his course on 
Methods of Social Research. The objectives of this course, 
and hence of the present book, are: à 

1. To emphasize the importance of accurate and objective 
observation as the first step in scientific method, on which 
the value of subsequent analysis and the validity of the con- 
clusions depend. y 
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2. To inculcate a healthy skepticism of, and a critical spirit 
toward, statistical data by acquainting the student with the 
difficulties, danger, and inaccuracies to which the collection 
and interpretation of social data are especially subject. 

3. To give a general knowledge of the technique of gather- 
ing original data. 

The value of the first two of these objectives to the general 
student will probably be admitted by everyone. With the 
increasing use of quantitative data in the literature and so- 
cial administration of the day, a critical discrimination in 
their production and consumption becomes increasingly de- 
sirable. The last objective, in addition to being a goal in it- 
self to the research student and the field worker, is a means 
to the achievement of the first two purposes. 

The need for a discussion of the topics treated in this book 
was impresséd on the author by the fact that while there are 
several dozens of treatises on statistics and methods of classi- 
fying, generalizing, and scientifically manipulating quantita- 
tive social data, there are only three or four which place the 
main emphasis on the scientific observation and recording of 
these data. The main reason for this fact is, of course, that 
the methods of handling and manipulating data are largely 
the same in all sciences, and hence it is natural that this 
phase of scientific method should be much’ more highly de- 
veloped in the social sciences than the first step, namely, the 
observation and recording of data. This is the step in scien- 
tific method which is likely to be most different in the vari- 
ous fields of science, because it is the step concerned largely 
with the matter of units, nomenclature, and instruments of 
observation which may be very different in different fields. 
As a result, statistical technique of manipulating data already 
collected inthe social sciences has become more highly de- 
we than the technique of observing and recording the 

ta. 

Obviously there is little to be gained by the application of 
refined statistical technique to data which are grossly inac- 
curate, indefinite, or otherwise defective from the standpoint 
of observation and recording. Says Wilcox: “I cannot but 
feel that there may be some danger of turning the attention 
of the class too exclusively either to graphic methods of rep- 
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resenting results or to mathematical methods of determin- 
ing the exact degree of correlation. Doubtless each of these 
has an important place in a course in statistics, but neither 
strikes at the spot in which most errors are found. . . For a 
large majority of statistical errors are those which creep into 
the raw material and cannot be eliminated by any refine- 
ments of mathematical analysis or any skill in graphic repre- 
sentation. To center attention on these field errors has the 
incidental advantage of bringing home to the student ‘as 
nothing else can the real difficulties before the official statisti- 
cian. ..”1 Certainly it is this phase of social science which is 
in greatest need of development today. To call attention to 
this need, as well as to make a few general suggestions toward 
meeting it, is the purpose of the present work. 

I have tried to give full credit in the text and in foot- 
notes to the various authors on whose works Ihave drawn, 
but realize that it is’ perhaps impossible to make adequate 
acknowledgments in this way. Especially is this true of the 
ideas I have drawn from my former teachers. First among 
these should be mentioned Professor L. L. Bernard, to whom 
more than to any other person I am indebted for my general 
viewpoint as well as for numerous quotations in the text. 
To Professor F. Stuart Chapin+I owe a threefold debt — for 
training in quantitative methods, for permission to draw lib- 
erally on his published works, and for reading a large por- 
tion of the manuscript and making valuable suggestions. To 
Professor M. C. Elmer I am likewise indebted for his reading 
of a large portion of the manuscript and for constant en- 
couragement and suggestions. I am also under obligation 
to Professor R. E. Chaddock for valuable suggestions, en- 
couragement, and criticism of part of the manuscript. 

In addition to the above I wish to express my appreciation 
to Professors L. L. Thurstone and Jerome Davis, and to Dr. 
Emil Frankel for permission to quote liberally from their 
published writings, and to Dorothy Swaine Thomas for the 
privilege of examining a portion of the manuscript of “The 
Child in America” by D. S. and W. I. Thomas. Acknowl- 

1W. F. Wilcox, “Cooperation between Academic and Official Statisticians,” 
Publications of the American Statistical Association, Vol. 14 (N. S), 1914715» 
Pp- 292-93- 
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edgments are also due to the ane t Journal of Sociology, 
The Century Company, and Social Forces, for permission to 
reprint extracts from their publications. 
inally, I am under great obligation to Professor John F. 
Markey for reading the major part of the manuscript and 
making suggestions of great importance ; to my colleague Cal- 
vin F. Schmid for a discriminating reading of the proofs ; and 
above all to Professor Read Bain, who has not only read the 
entire manuscript critically but has given unsparingly of his 
time in extensive criticism, much discussion, and constant en- 
couragement. For the errors and shortcomings which re- 
main, the author is, of course, solely responsible. 
GEORGE A. LUNDBERG 
Pittsburgh, Pa. 
July 1, 1929 
As 


-t 
PREFACE 


TO THE SECOND EDITION 


The extensive developments in methods of social research 
during the past decade have necessitated a complete rewrit- 
ing of all but four chapters of the First Edition. Extensive 
additions have been made even to those chapters of the First 
Edition which have been in part retained (Chapters II-V). 
Occasional paragraphs and pages of other chapters of the 
original edition have also been incorporated into the present 
text, but nearly all of these passages have been modified to 
some degree. The ‘orginal chapter headings have been re- 
tained for the most part, with some changes in their order. 
The chapter on Case Work and the Statistical Method of 
the First Edition has been eliminated as a separate chapter. 
The aspects of this subject which are of contemporary inter- 
est are discussed in Chapters I and XI of the present text. 
Two new chapters have been added, namely, Chapter VII on 
Questionnaires and Chapter X on Sociometric Methods in 
Ecology and Inter-personal Relations. At least three-fourths 
of the present edition is entirely new, both as regards the 
text and the research referred to. Most of the references are 
to studies that have appeared during the last five years. 

The present text places greater emphasis than did the First 
Edition on practical problems of research, and devotes rela- 
tively less space to controversial theoretical topics. Two con- 
ditions have influenced me in this modification. First, I 
have devoted considerable space in another recent volume to 
the more theoretical questions of methodology. Instead of 
going over this ground again in the present volume, the posi- 
tion in question has been briefly summarized, and references 
made to the more complete discussion to be found in my 
Foundations of Sociology (Macmillan, 1939). 

Second, my experience in practical non-academic research 
since the first edition appeared, has doubtless influenced 
both the selection of subject matter and its treatment in the 
present edition. For example, my research opportunities 
as Director of the Bureau of Social Research of the Pitts- 
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burgh Federation of Social Agencies, as director of a research 
project on leisure and recreation for the Council for Re- 
search in the Social Sciences of Columbia University, and 
as Research Supervisor in the Federal Emergency Relief Ad- 
ministration, impressed me with certain aspects of research 
which were relatively neglected in the First Edition. My 
recent community studies on interpersonal relations, like- 
wise, have caused me to deal in the present edition with 
some of the problems and techniques of this type of research. 
In any case, the present text is primarily a textbook on prac- 
tical techniques of securing basic data on sociological ques- 
tions. It aims to make available to students and teachers the 
most valuable recent experience and criticism on this sub- 
ject. Both for thege reasons, and because of the general 
trend of research during the past decade, the reader will 
probably find much less material of a controversial nature 
in the present edition, and very little with which anyone will 
disagree with reference to practical advice to researchers, re- 
gardless of his individual views on a number of questions of 
sociological theory. 

At the same time, the breadth of the field andthe variety 
of the methods prevent giving equally full treatment to all 
of them. This is not necessarily a reflection on the value of 
the neglected methods. In view of the fact that this point 
is frequently overlooked, it may not be amiss to repeat what 
I said in the First Edition on this subject : 


The social sciences cannot afford, in their present stage of de- 
velopment, at least, to neglect any method which promises to 
throw light on the problems which we seek to solve. The argu- 
ment in this book is for the refinement and objectification of all 
the methods which at present admittedly contribute something 
to our understanding of social behavior. There is, for example, 
no desire to discredit or abandon the fullest possible use of life 
history documents, intensive historical studies of individual cases 
or institutions, and other discursive and non-quantitative ap- 
proaches. What has been advocated is that the technique of 
classifying and analyzing such information be improved in such 
a way as to permit scientific generalizations from the data. This 
involves the development of a technique for the statistical sum- 
marization of these case studies. These methods tend to merge 
with the procedures we have considered, to the extent that they 
become objective in their technique. Instead of regarding life 
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history and case study methods as fundamentally different from 
the tests and statistical techniques, therefore, they should be 
regarded as different stages in the development of a common 
scientific method. The firal test of the validity of a method 
is its efficacy in securing the type of results sought. It follows that 
we are justified in the use of any method, regardless of its degree 
of objectivity, if it helps to throw even a little light on the prob- 
lem we are seeking to solve. From this point of view most social 
problems call for a composite method — the full utilization of 
historical and genetic sources, the non-quantitative description of 
individual cases in complete social situations, as well as the more 
rigid, objective, and quantitative tests —all of which may give us 
a more accurate account than any single approach can give. The 

urpose in pointing out the weaknesses of certain methods, there- 
ore, is not to advocate their abandonment as long as they con- 
tribute something to our knowledge. Theypurpose of evaluating 
them has been to help us avoid extravagant assumptions as to the 
validity of the pae which they are capable of yielding. 
(First Edition, pp. 241-42.) 2% 

For the general bibliography and appendices of the First 
Edition I have substituted at the end of each chapter spe- 
cific suggestions for further study of the subject under con- 
sideration. The enormous increase during the past decade 
of literature on the subjects discussed in the present book 
makes it necessary for the student in the future to rely on 
annotated bibliographies indicating which studies are likely 
to contain material relevant to a particular inquiry. On the 
subject of attitude research alone, for example, there ap- 
peared in periodicals in English, during the years 1937-39 
inclusive, some two hundred titles, not counting relatively 
inaccessible theses and papers and excluding all studies hav- 
ing no bearing on methodology. Annotated bibliographies 
for the field have, fortunately, become increasingly available, 
enabling students to go more directly to the relevant sources. 
A bibliography of such bibliographies, aggregating many 
thousands of titles, is appended to the present volume. _ 

My first obligation to be acknowledged in connection with 
the preparation of the present text is to the users of the First 
Edition. Their criticisms and suggestions are chiefly respon- 
sible for the improvements which will be found in the pres- 
ent volume. Among the many who have made helpful sug- 
gestions in this way are Raymond F. Sletto of the University 
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of Minnesota, Paul F. Lazarsfeld of Columbia University, 
Stuart C. Dodd of American University of Beirut, Ruth In- 
glis of the Research Division of the Curtis Publishing Com- 
pany, J. E. Bachelder of the University of New Hampshire, 
and Calvin F. Schmid of the University of Washington. Iam 
indebted to the first three of these persons and'to F. F. 
Stephan of Cornell University for permission to quote lib- 
erally from their published works. Mr. Sletto and Miss In- 
glis have also read parts of the manuscript and made valuable 
suggestions. Norman E. Himes, in his capacity as editor of 
Longmans Sociology Series, has greatly improved the form of 
the final manuscript by numerous valuable criticisms. I am 
indebted also to Robert Bierstedt of Bard College, Columbia 
University, for his painstaking reading of the proofs. My 
greatest obligation is to Harry Alpert of the College of the 
City of New York, who has not only read the entire original 
manuscript, but has given generously of his time in written 
and oral discussion in the course of which he has made 
valuable criticisms and suggestions. My indebtedness to all 
of the above and to others who have shown interest in the 


revision implies, however, no responsibility on their part for 
the faults and limitations which remain. 


GEORGE A. LUNDBERG 
North Bennington, Vermont 
June 1, 1941 
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CHAPTER I 


THE THEORY AND PLANNING OF SOCIAL 
RESEARCH 3 


If there are some subjects on which the results obtained have finally re- 
ceived the unanimous assent of all who have attended to the proof, and others 
on which mankind have not yet been equally successful ; on which the most 
sagacious minds have occupied themselves from the earliest date, and have 
never succeeded in establishing any considerable body of truths, so as to be 
beyond denial or doubt; it is by generalising: the methods successfully fol- 
lowed in the former enquiries, and adapting them to the latter, that we may 
hope to remove this blot on the face of science. — 7. S. Mill. 


A. FIELDS AND METHODS OF SCIENCE 


Social scientists are committed to the belief that the prob- 
lems which confront them are to be solved, if at all, by judi- 
cious and systematic observation, verification, classification, 
and interpretation of social phenomena. This approach in 
its most rigorous and successful form is broadly designated 
as the scientific method. The principal techniques and con- 
siderations that have been found most useful in the applica- 
tion of this method to social behavior are the chief subjects 
of this book. 

The degree to which the scientific approach to social 
problems has penetrated our generation (which is frequently 
called a “scientific age”) is perhaps greatly exaggerated, It 
is true that scientific information about many subjects, par- 
ticularly about the physical world, is more widely diffused 
than ever before. It is true that the term “scientific” carries 
with it a prestige which makes it a much desired and a much 
misused label. ‘But the scientific habit of thought, especially 
about social problems, has not penetrated very far. We still 
prefer to deal with these problems in the “good” old manner 
of our primitive ancestors. That is, we like to ascribe social 
problems to the malevolence of some persons and the solu- 

1A System of Logic, (8th Edition, Longmans, 1930), p- 546- 
i 
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tion of difficulties to the benignity of others. Political con- 
flicts, including. wars, are thought of almost entirely in this 
primitive framework. ; 

The gullibility of vast numbers of otherwise apparently 
intelligent people to the propaganda, quackery, and improb- 
abilities of the day is perhaps the best index of the degree to 
which scientific habits of thought, as contrasted with a lot of 
facts about science, have penetrated. Thousands of people, 
including the so-called educated, find it quite reasonable to 
believe and act upon a radio report of an invasion from 
Mars.? In times of war, again, vast numbers of people find 
their membership in a “scientific age” no preventive to be- 
lieving the most fantastic tales regarding the diabolical per- 
sonal and social characteristics of their enemies, to say noth- 
ing of the monstrous propensities of other people’s forms of 
government. 

The congeniality of this mode of thinking undoubtedly 
springs from our long habituation to thinking about human 
affairs in terms of Devils, Gods, Evil, Sin, and other such 
concepts. By virtue of its simplicity, as Einstein has pointed 
out,’ this theological thoughtway is understandable even by 
“the most undeveloped mind.” For this and other reasons, 
this mode of thought unquestionably has given through the 
ages a certain solace to man by imposing a kind of order upon 
the universe and the promise of the ultimate achievement 
of our desires. We shall not review here the course of events 
by which this type of orientation has been superseded gradu- 
ally as regards the physical world by the scientific type of 
thinking. As regards the social world, however, our emanci- 
pation from the thoughtways of our remote ancestors is quite 
superficial. Having divested ourselves of a few individual 
examples of our Sunday-school superstitions, many of us, in- 
cluding social scientists, feel very much liberated, sophisti- 


cated, and “scientific.” The superficiality of our emancipa- 
tion from the fundamental thoughtways, of which the belief 
in ghosts, hell-fire, etc., are merely illustrations, becomes 
painfully evident, however, in any period of even moderate 
social crisis. 


7H. Cantril, The Invasion from Mars, Princeton 


1 University Press, 1940. 
3"Science and Religion,” Science News Letter, Sex 


pt. 21, 1940, p: 181. 
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At such times the agencies of propaganda have little diffi- 
culty in convincing even a large number of scientists of the 
diabolical intentions of “The Enemy” for world conquest, 
enslavement of the people, and the danger of the ultimate 
triumph of sin and evil, according to sequences specified in 
familiar thought patterns. To raise the question as to the 
probability of these eventualities, in view of the known tend- 
encies of individual and social action, is equivalent to sid- 
ing with “The Enemy.” When the crisis is over, to be sure, 
the absurdity and credulity of our recent behavior is pointed 
out by students of the subject and is widely accepted by the 
public. So superficial is the habit of rational and scientific 
thinking about social events, however, that in situations at 
all critical, even the scientifically trained promptly abandon 
this comparatively recent acquisition in favor of the “mar- 
velous articles of savage-born lore” which the conditionings 
of childhood, and generations of pre-scientific thinking have 
engrained in us in a way that it will take centuries for sci- 
ence to overcome. ' 

To change these traditional ways of thinking about men 
and social events, rather than the mere diffusion of ine: 
mation, is undoubtedly the principal task of scientific edu- 
cation. At present, enormous energy Teserves in the form 
of highly undisciplined emotions of hate and love are gen- 
erated and dissipated throughyextremely primitive instru- 
ments of social organization, consisting fundamentally of an 
inadequate technology of thought and action regarding so- 
cial problems. Brotherly love, on the one hand, and hatred 
of “evil,” on the other, are invoked in the face of problems 
that will yield, if at all, only to instruments of precision like 
“those of the surgeon and the engineer in hands that do not 
shake with fear, with anger or with love.” The reluctance 
to discipline these satisfying and virtuous emotions in our 
dealing with social questions is undoubtedly one reason for 
the slow permeation of the scientific method into the field of 
social relations. 

Another and more specific reason is the nature of our sci- 
entific education and our notion as to what science is. In- 
quiry even among college students will reveal that science 
is generally thought of as a type of subject matter rather than 
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as a method of study. To some, science means colored liquid 
in a glass tube; to others, the paraphernalia of the physics 
laboratory ; to still others, it signifies a terminology liberally 
interspersed with mathematical formulas. The idea of sci- 
ence as a particular method of study, a definite set of rules 
of procedure and of logic applicable to any subject matter 
has been neglected in our preoccupation with the startling 
findings and achievements of the better developed sciences. 
We shall use the term “science” in this book chiefly to desig- 
nate a method, although we shall occasionally use it, as is 
common practice, to designate also any body of knowledge 
arrived at by this method. 

The question as to what fields of human knowledge may 
properly be dignified by the name of science is still largely 
a matter of controversy and opinion. Some of the older and 
more highly developed fields of knowledge, such as astron- 
omy, chemistry, and physics, are conceded by practically 
everyone to be properly classifiable as sciences. Outside of 
these well charted departments, controversy over the proper 
application of the term increases in proportion to the differ- 
ence in subject-matter and the newness of the more recently 
developed fields. Of course this controversy can never be 
absolutely settled, because the criteria of science always will 
be stated in relative terms. For example, it would be im- 
possible to define exactly at what point chemistry or physics 
passed from magic and alchemy to the status of science. 

Fortunately it is not necessary for our purpose to lay down 
such absolute definitions. There are some general criteria 
of scientific method which, in spite of the differences of opin- 
ion referred to above, are sufficiently well accepted to permit 
us to know when and to what degree they have been applied 
in a given field. But when a general criterion or definition 
of science is attempted it is found that such definition tends 
to be in terms of method rather than offsubject-matter. All 
that the term “science” as applied to a particular field comes 
to mean is a field which has been studied according to cer- 
tain principles, i.e., according to scientific method. If our 
knowledge of a certain field has been derived according to 
this method, and if that knowledge is applicable to this field 
for purpose of prediction and control, then that body of 
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knowledge may properly be designated as a science, to the 
extent that it has been scientifically derived, regardless of 
the nature of its subject-matter. ‘The test of the thorough- 
ness with which the method has been applied is found in our 
ability to predict the behavior of classes of phenomena under 
given conditions. 

The attempt to define science in terms of subject-matter 
causes only confusion. The important idea to emphasize 
is that science is a unity, and that the various sciences are 
merely convenient differentiations of this unity on the basis 
of the different problems to which this common method is 
applied. Says Karl Pearson : 


The scientific method is one and the same in all branches. 

.. The unity of all science consists in its method, not in its 
material alone. The man who classifies facts of any kind what- 
ever, who sees their mutual relation and describes their sequences, 
is applying the scientific method and is a man of science. The 
facts may belong to the past history of mankind, to the social 
statistics of our great cities, to the atmosphere of the most distant 
stars, to the digestive organs of a worm or to the life of a scarcely 
visible bacillus. It is not the facts themselves which make sci- 
ence, but the method by which they are dealt with.+ 


This is a consideration also, which might be reflected upon 
with profit by those who are inclined to think of science as 
a method applicable only to the fields in which it is con- 
spicuously successful at the present time. 

Science, then, is fundamentally a technique of deriving 
reliable knowledge about any type of phenomena in the uni- 
verse and then applying this derived knowledge for the pur- 
poses of prediction and control. 


B. LEVELS OF SCIENTIFIC PROCEDURE 


We have said that broadly speaking, scientific method con- 
sists of systematic observation, classification, and interpreta- 
tion of data. Now, ‘obviously, this process is one in which 
nearly all people engage in the course of their daily lives. 
The main difference between our day to day generalizations 
and the conclusions usually recognized as scientific, lies in the 
degree of formality, rigorousness, verifiability, and general 


4 Karl Pearson, The Grammar of Science, (Third Edition, A. and C. Black, 
London, 1911), pp. 10, 12. 
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validity of the latter. This is, however, necessarily a matter 
of degree. We may for convenience recognize four degrees 
or levels of this process : * 

(a) The simplest and crudest method of research is ran- 
dom observation — merely trusting to chance that some valu- 
able datum or suggestion may turn up in the course of mis- 
cellaneous exploratory surveys, studies, or experiments. A 
great deal of social research is still in this stage and, while 
we should not be content to keep all or most of our research 
on this level, it is not to be ignored. Even the more devel- 
oped sciences are always on the alert for curious but signifi- 
cant, unexpected, or unlooked-for phenomena which may 
suggest hypotheses and more systematic investigations of 
great importance. The discovery of the x-ray is a classical 
example of the occasionally enormous significance of acci- 
dentally encountered observations. 

_ While recognizing that such random observation is some- 
times productive, we must remember that the method is 
fruitful, chiefly in suggesting hypotheses, the verification of 
which still depends upon the more formal and systematic 
methods discussed below. We must not be misled into think- 
ing that because a lucky guess sometimes turns out to be sci- 
entifically demonstrable, that guessing is therefore a way of 
arriving at scientific generalizations. Nor must we allow the 
above recognition of the possible value of random observa- 
tion to commit us to the cliché that because “all science is 
a search for truth” therefore (a frequently unspoken non- 
sequitur) all search for truth (by whatever methods) is science. 
. (b) A second method of very wide and useful application 
in research is systematic exploration of broad fields or sub- 
jects. It differs from the first method chiefly and impor- 
tantly in being systematic. This seems to be the method 


advocated by Francis Bacon in his N umeOrganum.® In 


modern times, the discovery of salvarsan byiEhrlich perhaps 


illustrates in a general way thi i 
r y this very important type of re- 
search. Most social research (e.g, community studies like 


5 Adapted from C. F $ -= 
ayc Ds A. a a he Conflicting Psychologies of Learning — A 
van „earni. lowe olo 
aan . sien of L ing and Novum Organum, (Revised Edition, Colo- 
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the Lynds’ Middletown) is probably now conducted on this 
level. 

(c) A third level of scientific research is the testing of 
well defined but isolated hypotheses, either by experiment 
or by statistical methods. An increasing proportion of social 
research is gradually attaining this level of specificity, and a 
great many of the better established generalizations have been 
arrived at or are being tested by this procedure. This level 
of research differs from the preceding chiefly in the degree 
of explicitness with which the hypotheses are stated. All 
gathering of data involves an implicit hypothesis which causes 
us to select some data while neglecting others. Thus, the 
taking of a census involves at least the hypothesis that such 
data will be useful in the future, as they have been in the 
past. On the other hand, a study of the relationship be- 
tween the education of a population and its birth rate in- 
volves a more explicit and definite hypothesis. Some of the 
recent studies on housing, mobility, morale, and projects 
testing the relationship between economic condition and 
marriage rates are examples of this type of research.’ 

(d) The fourth and most advanced level on which scien- 
tific research is carried out is found in experiments or other 
crucial compilations of data, which are directed by systematic 
and integrated theory rather than by isolated and: vagrant 
hypotheses. This method begins with (1) a set of rigorous 
and unambiguous terms to be used. (2) Next is stated a set 
of postulates or hypothetical statements and their corollaries 
(inferences from the postulates) which are tentatively as- 
sumed to be true for the purposes of the investigation. (3) 
Thirdly, theorems are stated as formal propositions which 
could or should be true if the postulates, the corollaries and 
the reasoning in the theorems are sound. These theorems 
are really hypothetical generalizations, which direct the na- 
ture of the (4) irical observations undertaken to test the 
validity of the W explicit theoretical structure outlined 
above. 4 

A classical example of this procedure is, of course, Isaac 


T See, for example, $. A. Stouffer, “Intervening Opportunities: A Err | 
Relating Mobility and Distance,” Amer. Sociol. Rev., 5:845-67, 1940; F. S. 
Chapin, “An Experiment on the Social Effects of Good Housing,” Ibid., 868- 
79; D. C. Miller, “Morale of College Trained Adults,” Ibid., 880-89. 
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Newton's Principia.’ More recent examples are found in 
the systematic theoretical development of relativity by Ein- 
stein,? and of quantum theory? by Bohr, Rutherford, Hei- 
senberg, Schrödinger, Dirac and others. A well known illus- 
tration of investigation motivated by relativity theory is the 
astronomical observation whereby it was demonstrated that 
the image of a star, whose light rays had passed close to the 
sun, showed a certain amount of displacement from its true 
position, conforming both in direction and amount with 
deductions made from the theory." 

This method sharpens the research problem and the con- 
ditions of its solution to a point where the work of different 
empirical observers can be rigorously evaluated as either cor- 
roborative or contradictory. This formal procedure also fa- 
cilitates the detection of the reason for any discrepancy found 
by different investigations or between theory and empirical 
fact. 

By this procedure, disagreements between two or more 
scientists can be resolved by a set of well defined steps rather 
than by, mere argument, recrimination, and quarreling. 
First of all, it can be determined whether each scientist uses 
the same definitions and whether each uses them consistently. 
Secondly, the logical steps can be checked against the ac- 
cepted rules of reasoning. If the deductions are logically in- 
valid, the theory is unsound, regardless of the facts. Thirdly, 
if the verification of the deductions involves conditions of 
observation which are impracticable or impossible of attain- 
ment, the theory is metaphysical rather than scientific, Fi- 
nally, if the deduced result is not corroborated when the 
conditions of the theory are fulfilled, the theory is false. A 
discrepancy between the deduced results and the observa- 
tions may, however, be due to a flaw in arty stage of this 
process — postulate, definition, reasoning, or observation. To 
bring harmony between theory and obseryation it may be 
necessary to change the postulates, definitions, and/or rea- 


ror Principles, (Trans. by F. Cajori, Univ. of California Press, 
1934). 
9A. Haas, Introduction to Theoretical Physics, Vol. 11, (Trans. from grd 
and 4th editions by T. Verschoyle: Constable & Co., London, 1925), p- 299- 
WR. Reiche, The Quantum Theory, (Trans. by H. S. Hatheld and H. L. 
Brose : E. P. Dutton & Co., New York), 199. 

n A. Haas, op. cit, p. #70. 
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soning, assuming now that the observation is reliable. Dis- 
agreement between different theories can also be identified 
and explained in this way. 

It is true that this method works best in the developed 
sciences where rigorous experiment and mathematical state- 
ment are possible. Clark Hull * has, however, done notable 
work by this method in psychology, and, as a goal toward 
which to work, this remains the ideal of scientific procedure 
in the social as well as in the physical sciences. 


C. STEPS IN THE SCIENTIFIC METHOD 


We have emphasized above that inquiry which yields 
knowledge of the type we recognize as scientific may be, and 
is, carried out on different levels of formality, efficiency, and 
perfection. On all of these levels, however, there is discern- 
ible, explicitly or implicitly, a certain series of recognizable 
and verifiable steps which taken together constitute the sci- 
entific method. They may be summarized as follows: 

1. The working hypothesis. An hypothesis is a tentative 
generalization, the validity of which remains to be tested. 
In its most elementary stages, the hypothesis may be any 
hunch, guess, imaginative idea or intuition whatsoever which 
becomes the basis for action or investigation. Even the most 
detached dream or intuition has its basis in human experi- 
ence of some kind. We must not be deluded into thinking 
that,. because of the subjective (i.e., private, unverified) na- 
ture of these experiences, they are somehow generated spon- 
taneously in the “mind.” In searching for fruitful hypoth- 
eses, we may draw upon the whole realm of poetry, literature, 
philosophy, and the large descriptive literature of sociology 
and ethnology, including the speculative theories of artists 
and penetrating thinkers who have devoted themselves to a 
deep, if informal and unsubstantiated, study of man’s social 
relationships. But a working hypothesis must be formu- 
lated as explicitly and definitely as possible. It is in its ele- 
mentary form merely a definition of a problem or the object 
of any investigation, including a formulation of what facts 


_ 2C. L. Hull and others, “Mathematico-Deductive Theory of Rote Learn- 
ing,” (Yale University Press, 1940). An outline of a project proposing to fol- 
low this procedure in anthropology will be found in G. P. Murdock, “The 
Cross-Cultural Survey,” Amer. Sociol. Rev., 5:361-70, 1940- 
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will be necessary to test the hypothesis. As such, it limits 
the study to proportions that can be handled with the re- 
sources at hand and thus prevents aimless and random gath- 
ering of data. Herein, of course, lies a grave danger. If the 
investigator conceives of his hypothesis as a proposition to 
be defended, to the exclusion of evidence that might support 
the opposite, or some other hypothesis, this, of course, viti- 
ates his work as unscientific. This danger, and the means of 
avoiding it, will be further discussed in a later chapter. 

2. The observation and recording of data. This step in- 
cludes the definition of units of enumeration and measure- 
ment. It consists of making numerous and accurate obser- 
vations with the human senses (i.e., responding to certain 
stimuli with frequency and consistency), assisted and cor- 
rected by instruments of precision. The technique of achiev 
ing accuracy and objectivity in these observations is the main 
subject of this book. Upon the accuracy of the original ob- 
servations depends the value of the refined analysis and the 
ultimate reliability of generalizations drawn from the data. 

3. The classification and organization of the data collected. 
It is only when facts are classified that uniformities, sequences 
and differences become apparent. These uniformities and 
sequences are the center of interest to science. Whenever 
uniformities and sequences can be demonstrated in any field, 
that field at once begins to come within the domain of 
science, i d : 

4. Generalization. When data have been collected and 
classified so as to reveal the patterns, uniformities, and se- 
quences, there remains the’ task of formulating a brief state- 
ment or description of these sequences in such form as tO 
apply to all similar phenomena in the universe studied under 
given conditions. This isa scientific “law.” ‘The ideal form 
of such a generalization is the mathematical formula. This 
formula is never absolute or exact. Even in the fields where 
scientific investigation has long been employed, the formulas 
and “laws” are subject to constant refinement and correction 
on the basis of more extensive and more accurate observation 
of the data. These corrections usually take place as a result 
of experimental testing of our generalizations. But the fact 
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that our formulas may not be exact does not, as we shall see, 
vitiate the value of such generalizations as a basis of scien- 
tific prediction, since the probable error is itself calculable. 
Caution must be exercised, however, in not applying gen- 
eralizations too broadly, i.e., not applying them to situations 
which differ in important respects from those for which the 
generalizations have been found valid. When this caution 
is observed, generalizations in turn become the basis for sci- 
entific theories of phenomena regarding which data or suffi- 


cient data are, as yet, wanting. 


D. COMMON MISCONCEPTIONS REGARDING THE NATURE 
OF SCIENCE AND OF SCIENTIFIC LAW 13 


Although the steps outlined above are reasonably definite 
and concrete, there are certain widespread misconceptions 
regarding them which should be noted. One of these mis- 
conceptions is that a scientific law is a so-called “natural law” 
inherent in the universe, perhaps laid down by some super- 
natural agency, and waiting to be “discovered” by some 
bright person. Thus Newton is credited with “discovering” 
the law of gravity. What he did was to describe certain uni- 
formities in the behavior of falling bodies. This generalized 
description of the behavior of phenomena under given con- 
ditions constitutes the scientific law, not the sequences, forces, 
or behavior of phenomena themselves, which, of course, ex- 
isted before Newton stated his law. In other words, a scien- 
tific law does not exist until, it is stated, although the be- 
havior and relationships which it describes may have existed 
for millions of years. Says Pearson in a brilliant passage : 


A scientific law . . . is the résumé or brief expression of the 
relationships and Ae poo of certain groups of . . . perceptions 
and conceptions, and exists only when formulated by man... 

We are thus to understand by a law in science, i.e., by a “law 
of nature,” a résumé in mental shorthand, which replaces for us 
a lengthy description of the sequences among our sense-impres- 
sions. Law in the scientific sense is thus essentially a product of 
the human mind and has no meaning apart from men. . . 

The reason we find in natural phenomena is surely put there 


13 For a more complete and adequate development of this subject, sec my 
Foundations of Sociology, ch. 4. 
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by the only reason of which we have any experience, namely the 
human reason . . . the logic man finds in the universe is but the 
reflection of his own reasoning faculty.1* 5 


A further confusion as to the meaning of law in science 
results from applying the term to propositions of pure the- 
ory or logical truisms as well as to generalizations from em- 
pirical observations. Thus, such a proposition as the “law” 
of inertia, to the effect that matter will remain at rest unless 
some force acts upon it, is obviously true if we define force 
as that which produces motion. Likewise, Spencer's “law” 
that social motion follows the line of least resistance is obvi- 
ously true if “the line of least resistance” is always defined as 
that line which the motion actually did take as contrasted 
with other possible lines it might have taken. Such scien- 
tific tautologies, truisms or axioms may be valuable as hy- 
potheses or postulates in giving framework and direction to 
investigation. They should not be confused, however, with 
empirical laws derived from the statistical generalization of 
empirical observations, describing how phenomena will ac- 
tually behave under stated conditions. As Hutchison! has 
said : 


The prevailing tendency to call propositions of pure theory 
“laws” is misleading and inappropriate, and appears to be a sur- 
vival from eighteenth-century rationalist philosophy, and theol- 
ogy. .. We suggest that the term “law” should be reserved 
only for those empirical generalizations such as Pareto’s or 
Gresham’s law or the law of diminishing returns, or diminishing 
marginal utility. It is such’ laws as these that it is the central 
object of science to discover. 


A second serious misconception of the meaning of scien- 
tific law is that which holds it to be an exact statement of 
uncontrolled behavior under all conditions. As a matter of 
fact, scientific laws are nothing of the sort. They merely 
state how phenomena behave under stated conditions. ‘These 
conditions are sometimes highly abstract and artificial, and 
do not obtain anywhere in the uncontrolled universe. Thus, 
to use Bernard’s illustration, the law of falling bodies de- 


14 Karl Pearson, op. cit., pp. 82, 86, 87, 91. 


ast 9 
15 T. W. Hutchison, The Significance and Basic Postulates of Economic 
Theory, (Macmillan, 1938), pp. 62, 64. 
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scribes how bodies fall in a vacuum, at sea-level2® In other 
words, only under artificial conditions does this law hold 
good as a description of the behavior of falling bodies. The 
fact that these conditions nowhere obtain in the natural uni- 
verse does not in any way vitiate the value of this law for 
scientific purposes. It furnishes a norm or a principle from 
which variations may be computed or estimated. 

A third misconception regarding the nature of science and 
scientific law is the idea that it is the function of science to 
“explain” behavior, in the sense of accounting for it in terms 
of first causes. This is, of course, a pretension which no sci- 
entist has ever advanced, and which no science has ever ac- 
complished. All that science pretends to do is to describe 
certain sequences in the behavior of the phenomena it stud- 
ies. In the physical sciences, a description of these sequences 
is usually accepted as constituting the only kind of explana- 
tion we know anything about. But in the social studies, we 
find learned discussions of the limited usefulness of the 
methods of natural science because all science can do is to 
describe, whereas in the social sciences we must also have 
explanation in the sense of some ultimate cause. This is a 
survival of a pre-scientific mode of thought. The only ex- 
planation known to science is conceptualized description. 

As a matter of fact that is what all explanation consists of. 
We say that anything is “explained” or “understood” when 
we have reduced a ‘situation to elements and correlations 
with which we are so familiar that we accept them as a mat- 
ter of course so that our curiosity rests. By “element” we 
mean any component which we do not consider it necessary 
or possible further to analyze. Understanding a situation 
means, from the operational point of view, discovering fa- 
miliar elements, and correlations between them.%7 

With reference to more “ultimate” kinds of explanation, 


Veblen’s remarks are pertinent : 


18L. L. Bernard, “The Objective Viewpoint in Sociology,” Amer. Jour. 
Sociol., 25:307, 1919. This is 9 very frequently overlooked point. See W. M. 
Kozlowski, “The Logic of Sociology,” Amer. Jour. Sociol., 33:912-21, 1928. 
Also Eleanor Bisbee “Objectivity in the Social Sciences,” Phil. of Sci., 4:371-82, 


1937. , 
Wop. P. W. Bridgman, The Logic of Modern Physics, (Macmillan, 1932), 
p. 37. 
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Questions of a primordial beginning and a definite outcome 
have, fallen into abeyance within the modern sciences, and such 

uestions are in a fair way to lose all claim to!consideration at 

e hands of the scientists. Modern science is Basing to occupy 
itself with the natural laws—the codified rules of the game of 
causation —and is concerning itself wholly with what has taken 
place and what is taking place... The sciences which are in any 
peculiar sense modern take as an [unavowed] postulate the fact 
of consecutive change. Their inquiry always centers upon some 
manner of po . . . and comes to rest provisionally when it 
has disposed of its facts in terms of process. But modern scien- 
tific inquiry in any case comes to rest only provisionally ; because 
its prime postulate is that of consecutive change and consecutive 
change can, of course, not come to rest except provisionally. By 
its own nature the inquiry cannot reach a final term in any 
direction. So it is something of a homiletical commonplace to 
say that the outcome of any serious research can only be to make 
two questions grow where one question grew before.18 


E. THE POSSIBILITY OF A SCIENCE OF SOCIOLOGY 


The steps and considerations outlined above are generally 
recognized as the essence of scientific method. The validity 
of this method for the understanding and control of physical 
phenomena is generally conceded. If one casts even a super- 
ficial glance over modern western civilization there is one 
fact which stands out above all others in significance, namely, 
the astonishing development of physical science and its em- 
ployment in the control of the physical world, ~ 

The same glance reveals the relatively chaotic and uncon- 
trolled state of affairs in the field of human relation’. Every 
day there is a new tale of conflict, bloodshed, disorder, crime 
and misery, owing to man’s imperfect control of his relations 
with his fellowmen. It is only very recently that the idea 
has gained currency that perhaps the method which has been 
found so successful in bringing the physical world under 
man’s understanding and control might also be a profitable 
one to pursue toward the understanding and control of so- 
cial relationships. 

With the development of this idea, the question as to the 
applicability of this method to social phenomena has become 
of increasing interest. In the controversy and speculation on 


18 Thorstein Veblen, The Place of Science in Modern Civilization, (Viking 
Press, 1932), pp. $2, 33, 37, 38. 
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this point, a number of objections are presented by those 
who take the negative view, which seem to them conclusive 
evidence of the impossibility of a social science in anything 
like the sense in which the term is employed in chemistry or 


physics. Therefore, a careful consideration of these objec- 


tions is desirable at this point. 

Most of the objections against the idea of exact social sci- 
ence or sciences flow from the feeling that there are certain 
inherent differences in physical and social data precluding 
or largely invalidating the employment of the scientific 
method in the study of social phenomena. This objection, 
as generally held, reflects the deep-seated notion, already re- 
ferred to, that science is, after all, a body of subject matter 
rather than a method. The more specific and detailed ob- 
jections are to a certain extent corollaries of this central 
issue. It will be well to consider further, therefore, the dif- 
ferences between social and physical phenomena which are 
alleged to be fatal to the formulation of scientific laws of 
social behavior. 

To begin with, the history of science furnishes some highly 
suggestive and illuminating data on the question of the ap- 
plicability of scientific methods to new fields. The question 
as to what is the proper or possible field of science is not a 
new one. It is as old as science itself. It appears that the 
proper and possible field of science has always been defined 
SO as to coincide strictly with the domain within which sci- 
ence has, at the time, established itself and secured recogni- 
tion. Says Pearson: Py i 


Wherever science has succeeded in ascertaining the truth, there, 
according to the school we have referred to, are “the legitimate 
problems of science.” Wherever science is yet ignorant there, we 
are told, its method is inapplicable; there some other relation 
than cause and effect (than the same sequence recurring with the 
like grouping of phenomena), some new but undefined relation- 
ship rules. In these fields, we are told, problems become philo- 
sophical and can only be treated by the method of philosophy. . . 
Now it will, I think, be found that the fields of inquiry, where 
science has not yet penetrated and where the scientist still con- 
fesses ignorance, are very like the alchemy, astrology, and witch- 
craft of the Middle Ages. Either they involve facts which are in 
themselves unreal — conceptions which are self-contradictory and 
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absurd, and therefore incapable of analysis by the scientific or 
any other method — or, on the other hand, our ignorance arises 
from an inadequate classification and a neglect of scientific 
method.!? $ 


So much for the general barring of scientific method from 
certain fields on the ground of fundamental differences in 
the phenomena.” But there remain some more specific ob- 
jections based on apparent difficulties in the application of 
scientific method of social phenomena. 


F. THE COMPLEXITY OF SOCIAL DATA?! 


Perhaps the most frequently urged obstacle to a true sci- 
ence of human group behavior is the complexity of its sub- 
ject matter. Human groups in their tremendous sensitivity 
to physical and social stimuli and in their enormous cultural, 
psychological, and temperamental attributes which condition 
their behavior, appear to be such an inextricable combina- 
tion or confusion that the mind of man is staggered in its 
attempt to read order, sequence, and law into the behavior 
of a social group. Stuart Chase has imagined a man’s view 
of North America from an airplane in these words: 


He would only see the farms, the forests, the mines, the rail- 
roads and highways, the rivers, canals, transmission lines; the 
factories, warehouses, stores, schools, libraries, theatres, golf 
courses, and homes ; the behavior of some 100,000,000 men, wom- 
en and children in relation to these things. Men digging, and 
plowing, pulling the throttles of engines) balancing on steel 
girders, painting signboards, holding steel under a drill, wrap- 
ping up packages, driving trucks, bending over desks, talking 
through telephones, jamming people into elevated train doors, 
fishing on the high seas, fighting forest fires, pumping oil wells, 
reading newspapers, yelling at ball games, sleeping, eating, love- 
making, going to church, dancing, swimming, climbing moun- 
tains — pacing in a striped suit through prison corridors. Women 
minding spindles and babies and cook stoves, playing Mah Jong, 
drinking tea, smoking cigarettes. Children answering the school 


19 Karl Pearson, op. cit., pp. 18, 22, 23. 

20 For an elaboration of this position, see my Foundations of Sociology, 
pp. 9-22. 
21 An excellent elaboration of this subject may be found in two articles by 
Read Bain, “The Concept of Complexity in Sociology,” Social Forces, 8:222-81, 
1929, and 369-78. 1930. 
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gong, twisting in their seats, rushing through the playground, 
working in cotton mills, tossing with fever. . . vast con- 
glomeration of human activity.?? 

Does it not seem unreasonable to hope that the mind of man 
can ever reduce this chaos to order, and define this extraor- 
dinarily complex surging and milling of the human ant-heap 


into fairly exact scientific laws? On what grounds may such 


optimism be entertained ? 

In the first place, there is the fact that in all its complexity 
there is nevertheless to be observed, even by the casual stu- 
dent, certain patterns, sequences, and order in the multitu- 
dinous activities of a social group. Its behavior is by no 
means haphazard, random, and impossible to predict. On 
the contrary, if we take a group of which we are a part or 
which we know well, we find that most of its behavior is of 
a highly routine, uniform and predictable nature. Given 
certain conditions of a situation, we find ourselves able to 
predict with a high degree of accuracy what a group will do. 
We find that it tends to do certain things at certain times, 
e.g., eating, sleeping, going to church. It tends to wear cer- 
tain kinds of clothes on certain occasions. As a matter of 
fact, our own activities and social organization in general is 
founded upon this tendency of a social group to behave in a 
definite manner under given conditions. 

We find ourselves, therefore, making very accurate esti- 
mates and predictions as to the future behavior of human 
beings and of social groups. Thus, the behavior observed 
from the airplane, while it may seem complex, is not chaotic 
after we have studied it a little. It is chaotic, disordered, 
“meaningless” only to the superficial observer. The more 
we study it, the more orderly and predictable this behavior 
becomes. What is a scientific law, but a brief description of 

Stuart Chase, The Tragedy of Waste, (Macmillan, 1930), pp. 17, 18. It is 
pertinent to know that the physical scientists are equally im ressed with the 
complexity of their subject matter. Thus, Sir Ernest Rutherford in his presi- 
dential address to the British Association for the Advancement of Science 
(1929) said : “When we consider the extraordinary onpa of the electronic 
ym we may be surprised that it has been possible to nd any order in 


e apparent medley of motions.” Quoted in H. Buckley, A Short History 
of Physics, (Van Nostrand, 1928), p. 254- 
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how phenomena behave under given conditions? This is 
not only possible but has been practiced in a more or less 
systematic way from the very beginning of society. All that 
social science aims to do is to change this practice from a 
rule-of-thumb procedure to an objective and more exact 
practice permitting corroboration. 

The fact that the hopeless complexity and chaos of social 
behavior tends to disappear with closer study perhaps sug- 
gests the real nature of the complexity generally urged as an 
insuperable obstacle to an exact social science. Any situa- 
tion or behavior of phenomena is complex when we do not 
understand it. Complexity is always relative to our under- 
standing or mastery of a given behavior. To the baby who 
has not yet learned to walk or speak, the perambulations and 
conversation of its elders constitute behavior of incredible 
complexity. Complexity is not definable in absolute or ob- 
jective terms. It is a subjective measure of the degree of 
adjustment to a situation. The complexity of human soci- 
ety, in other words, is largely a function of our ignorance 
of it. 

It is true that the number of variables involved in some 
situations is greater than in others, and that complexity may 
be meaningfully defined in these terms. But the definition 
of a situation is itself an act of discrimination with reference 
to the adjustment techniques we possess. In short, the com- 
plexities of nature have been approached both by simplify- 
ing the situations and by developing techniques for handling 
increasingly large numbers of variables. The maturity of a 
science is measured by the degree to which it has succeeded 
in either or both of these respects. If some problems of 
physics today have fewer variables than some problems of 
sociology it is because physicists have found it advisable to 
exercise modesty in the scope of the problems which they set 
for themselves. They have been content to work patiently 
on a number of small but related problems. But their efforts 
in the aggregate have produced the magnificent synthesis of 
physical science as a whole. Social scientists tend to be im- 
patient of painstaking study of limited problems, and incline 


rather toward the grandiose methods of the philosophy of 
history. 
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G. THE “SUBJECTIVITY” AND “INTANGIBILITY’” OF 
SOCIAL PHENOMENA 


Perhaps the most deepseated ard widely held reason for 
insisting on the fundamental difference between social and 
physical data is the idea that physical phenomena may be 
known directly through the “senses,” whereas many impor- 
tant social phenomena are known only symbolically through 
words representing such phenomena as tradition, custom, at- 
titudes, values and the whole realm of the so-called subjec- 
tive world. If this idea were true, it would indeed make an 
important difference in the two classes of data, for it would 
mean that we could have much more direct and objective 
knowledge of physical data because of their intrinsic quali- 
ties. But psychological investigation does not bear out the 
idea that our knowledge (in the sense of the power of ma- 
nipulating symbols which represent sense experience) of 
physical data is secured in any way essentially different from 
the way we gain our knowledge of social data. We can 
- have no knowledge of either except through the operation 
of the symbolic behavior mechanisms, usually the language 
mechanisms. 

The important thing to remember is that such words as 
tradition, custom, thought, feeling, etc., stand for observable 
human behavior of some kind, and are therefore subject to 
scientific study by the same general methods as other be- 
havior. It is true that different techniques and instruments 
have to be used for the observation of different types of be- 
havior. Some behavior we observe directly with the naked 
eye and report in folk terms. Other behavior is observable 
only through instruments, for example, microscopes and 
telescopes, Still other types of behavior are observed through 
listening through stethoscopes, reading thermometers, or by 
reading the records of lie detectors, of machines that record 
“brain-waves,” etc. These readings are reported, communi- 
cated, and corroborated by means of special words invented 
for the purpose, such as degrees of temperature, etc. f, 

This point of view also resolves the old problem of in- 
trospection and the question as to whether observations of 
oneself can be used as scientific data. No one questions the 
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scientific validity of a physician's readings of his own tem- 
perature any more than we question his reading of other 
people’s temperatures. His observations are acceptable in 
either case only if he communicates them to us in a language 
which permits us to check his findings. The same rule 
would, of course, hold for his observations of his most inti- 
mate thoughts and feelings. When we develop techniques 
of observing and reporting “subjective” phenomena so that 
observations can be communicated and corroborated, they 
are as proper objects of scientific study as any other data. 
“Objective” and “subjective,” therefore, turn out to be not 
intrinsic qualities of different types of phenomena, but rather 
a designation of the degree to which we have developed 
checkable means of reporting our observations.*# 

Thus it appears that the differences between the data of 
sociology and of the physical sciences are more apparent than 
real. We do not know physical data any more directly or 
objectively than social data, except in so far as we have de- 
veloped more adequate instruments and symbols for record- 
ing the former. The uniformities in physical data which 
form the basis of all physical sciences are also observable in 
social data. As we have noted, scientific laws specify certain 
simple and frequently artificial conditions under which phe- 
nomena behave in a certain way. Under these conditions, 
the behavior can be predicted with a high degree of accu- 
racy. Likewise we can predict, with a high degree of accu- 
racy, how many people in a given city will be born, will die, 
commit suicide, or get married during the coming year, pro- 
vided, the significant conditions obtaining during the past 
years on which our observations are based remain the same. 
These conditions must always be specified in our prediction, 
and might be numerous and involved. Long and patient 
study of the environment will be necessary before we can 
specify all the significant conditions. But such reservations 
are the sine qua non of all scientific laws. The difficulties, 
therefore, which appear to preclude the possibility of a true 
science of society, derive from our undeveloped technique 


23 For a brief elaboration of this subject, see my Foundations of Sociology, 


pp. 17-22. References to a large literature on the subject will also be found 
in this source. 
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and methodology of study and our consequent unfamiliarity 
with the data, rather than from inherent differences in the 
data themselves. 


H. PREDICTABILITY OF SOCIAL BEHAVIOR 


A second common obstacle frequently urged against the 
possibility of the development of a true science of group be- 
havior is the apparent unpredictability of human behavior 
owing to our time-honored conception of the person as a 
free agent. Social phenomena are alleged to be subject to 
unpredictable “whims,” due to the “wills” and “minds” of 
human beings. Fortunately, the behavioristic approach in 
psychology is rapidly rendering obsolete these quaint meta- 
physical terms in their historic meaning. The behavior of 
human beings is recognized as being caused and conditioned 
in the same general way as the behavior of other phenomena. 
If from the top of a tower we drop a feather, it falls in a very 
irregular and unpredictable way. The way in which it falls 
and the rate at which it falls are determined by a large num- 
ber of factors, some of the more obvious ones being wind, 
moisture, and the size and shape of the feather. Still we 
have not hesitated to apply to it the law of falling bodies, 
which, as indicated before, describes the falling of bodies 
only under certain artificial conditions. The unpredictable 
gyrations, eddies, and spirals with which the feather falls 
might be attributed to “whims” and “will” with as much (or 
as little) justification as the behavior of social groups is at- 
tributed to such causes. The observed behavior is in both 
cases the result of environmental factors playing upon a par- 
ticular type of object. In both cases the environment and 
the object may be so simplified, artificialized, and standard- 
ized as to admit of a brief description of the behavior of the 
object under these conditions. This is all that the require- 
ments of a scientific law demand. 

The apparent unpredictability of group behavior, there- 
fore, is due to our present limited knowledge of the nature 
of the stimuli and responses operative in such groups. Some 
of the simpler chemical reactions were considered unpre- 
dictable some centuries ago for the same reason. Painstaking 
observation and classification of these reactions under given 
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conditions, however, yielded our present chemical laws. The 
same method has already yielded us some power of generali- 
zation and prediction of group behavior. There is every 
reason to believe that we can vastly extend this power. 

We have already referred (p. 20) to the fact that our every- 
day adjustments to our fellowmen are carried out on the 
basis of highly reliable predictions of what other people will 
do. The task of science is to develop more formal and gen- 
erally reliable methods of such prediction. Encouraging 
progress has already been made in this direction. Actuarial 
methods of determining the probability of death, marriage, 
twin or triple births, divorce, etc., for different age groups, 
economic classes, etc., have been familiar for some time. 
More recently the same general approach has been success- 
fully adopted in predicting success or failure in college, in 
marriage, and in probation and parole. The main prob- 
lem here as in all scientific prediction is to secure reliable 
and sufficiently extensive records of past experience under 
given conditions. The best if not the only way to secure 
such data is through systematic and comprehensive registra- 
tion of these events as they occur. Only in this way can we 
modify our predictions according to changing experience. 
Data on other types of social behavior which do not lend 
themselves to systematic registration must, of course, be gath- 
ered by whatever methods are found feasible.25 


I. QUANTITATIVE METHODS IN THE SOCIAL SCIENCES 


Finally, there is the contention that exact science tends to 
become increasingly quantitative in its units, measures, and 
terminology, while most of the important subject-matter of 
the social sciences is qualitative, and does not admit of quan- 
titative statement. The implied assumption in this position 
is, of course, that some kinds of data are intrinsically quali- 
tative, subjective, immeasurable. 

The untenability of such an assumption in the scientific 
universe of discourse must be evident from even a slight fa- 
miliarity with the history and methods of science. Such 


24 See Suggestions for Further Study at the end of this chapter for references 
to this material. 

25 For elaboration of this subject, see my Foundations of Sociology, pp- 
142-44. : 
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words as quantitative, measurable, objective, complex, homo- 
geneous, etc., must be regarded, under the postulates on 
which science proceeds, not as inherent characteristics of 
data but as designations of certain ways of responding and 
communicating responses.?¢ 

The assumed conflict between qualitative and quantitative 
methods must, then, be abandoned in favor of the view that 
these terms merely represent different stages of refinement 
and objectivity in our technique of description.?? When our 
knowledge is of a general, unanalyzed kind, we tend to de- 
scribe it in subjective and qualitative terms on the basis of 
the feelings which our perception of the facts evokes in us, 
or perhaps in terms of social evaluations according to exist- 
ing norms. Thus things may be described as hot or cold, 
red or green. But these terms mean different things to dif- 
ferent individuals ; and, as we have seen, one of the essentials 
of science is that its descriptions shall mean the same to all 
normally constituted minds. To eliminate the indefiniteness 
of subjective terminology, the general tendency of science 
has been to reduce the qualitative gradations to quantitative 
gradations. Thus hot or cold are described in terms of a 
thermometer scale ; color gradations in terms of light waves; 
and so on for all the qualities which have become relevant 
or important in a problem. But in the early stages of our 
thinking about a field, our descriptions will tend to be quali- 
tative. As Mitchell has pointed out: 


“, .. Even in the work of the most statistically minded, quali- 
tative analysis will keep a place. Always our thinking will cover 
a field larger than our measurements ; the preconceptions that 
shape our ends, our first glimpses of new problems, our widest 
generalizations, will remain qualitative in form. Indeed qualita- 
tive work will itself gain in power, scope and interest as we make 
use of wider, more accurate and more reliable measurements. . . 


26 Ibid., . 197-42. s 
21 Cf. HOEN Op: cit., pp. 68-69, “Leaving out of account, then, the kind 


of prognoses given by propositions of pure theory, it does not seem possible 
to give the terms “qualitative’ and ‘quantitative’ as applies to prognoses any 
more sense than ‘less accurate’ or ‘more accurate,’ and we suggest that their 
replacement by these terms would be less confusing.” 

Cf. also S. C. Dodd, Dimensions of Society, (Macmillan, 1941), ch. 1, sec. 4, 
for a penetrating analysis of the nature of quantitative as contrasted with 

ualitative symbols and an exact boundary between the two. See also Dodd's 
ch. g sec. 4B. 
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In the thinking of competent workers, the two types of analysis 
will cooperate with and complement each other as peacefully in 
economics as they do in chemistry.” ?8 


But this concession as to the place of qualitative analysis 
must not be interpreted as meaning that the distinction be- 
tween them is unimportant. On the contrary, it emphasizes 
the view that, for the more exact descriptions required by 
science, the quantitative statement is necessary. Herein lies 
the significance of this discussion for the social sciences ; for 
up to the present time they have consisted largely of quali- 
tative generalizations. The explanations and descriptions 
consist of lists of “desires,” “wishes,” “wants,” “drives,” etc. 
Professor Mitchell’s prophecy regarding economics would ap- 
ply with even greater force to sociology: “If my forecast is 
valid,” he says, “our whole apparatus of reasoning on the 
basis of utilities and disutilities, or motives, or choices, in the 
individual economy, will drop out of sight in the work of the 
quantitative analysts, going the way of the static state. The 
‘psychological’ element in the work of these men will con- 
sist mainly of the objective analysis of the economic behavior 
of groups. Motives will not be disregarded, but they will be 
treated as problems requiring study instead of being taken 
for granted as constituting explanations.” 2° Only when the 
quantitative stage is reached do our generalizations begin to 
partake of the nature of advanced or exact science. 

The qualitative worker, therefore, is not open to criticism 
for the mere fact of being qualitative in his approach, espe- 
cially in the newer fields. Work of the utmost value may be 
performed through this approach. But he is open to criti- 
cism when he contends that this is the only method appli- 
cable to his subject matter, and that therefore his qualitative 
descriptions represent the only description possible. He may 
be entirely sincere in his skepticism regarding the possibility 
of statistical restatement of his generalizations, and in the 
form in which he has stated them it may indeed be entirely 
possible that at present they are not susceptible to statistical 
attack. Says Mitchell: “Indeed . . . there is slight prospect 

28 Wesley C. Mitchell, “Quantitative Analysis in Economic Theory,” Amer. 


Econ. Rev., 15:12, 1925. 
29 Ibid., p. 5. 
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that quantitative analysis will ever be able to solve the prob- 
lems which qualitative analysis has framed, in their present 
form. What we must expect is a recasting of the old prob- 
lems into new forms amenable to statistical attack. In the 
course of this reformulation of its problems, economic theory 
will change not merely its complexion, but also its content.” 3 

What will be the nature of this change of content in soci- 
ology? As stated above, the present content consists largely 
of (1) a list of “desires, ut 
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wishes,” “forces,” “interests,” or 
“drives” under the urge of which social groups are alleged 
to engage in their varied activities; and (2) a bold forcing 
of the data into these categories on the basis of observed so- 
cial results of the activities: The difficulty, of course, as 
Mitchell has pointed out, is that “these interpretations are 
something which the theorist adds to the data, not something 
which he draws out of them.” As a result, a given behavior 
is frequently classifiable into any or all of the given cate- 
gories, according to the prejudice of the classifier. The new 
content will consist of the statistically summarized results ot 
an analysis of a large number of behavior records in all the 
fields of social behavior in which we happen to be interested. 


J- QUANTITATIVE METHODS AND CASE WORK 


But how are such generalizations to be useful in individ- 
ual cases? This is the fundamental objection of many prac- 
tical social workers. “No two cases are alike,” objects the 
social worker; “the technique to be employed in different 
cases is tremendously variable. It varies with every detail of 
my personality and that of my client. It has to be momen- 
tarily adjusted on the basis of actual developments in the 
course of the interview. How can statistical studies of large 
numbers of cases, resulting in some ‘average’ or type, be of 
value in such practical situations ?” 

Objections of this kind can be regarded as resulting only 
from a misunderstanding of what scientific description and 
statistical technique imply. The facts as stated are, of course, 
true. But wherein are such practical problems of diagnosis 
and treatment in any way different from the problems of ap- 
plied physical science? Is not the physician confronted with 


% Ibid., p. 3. 
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the same situation? Of course, no two cases are exactly 
alike in all respects. To be sure, treatment must be modified 
on the basis of observed reactions during its course. But 
who would contend that the principles of diagnosis and 
treatment on which the physician works are therefore use- 
less? And how are these principles developed? They are 
arrived at by the careful recording of large numbers of dif- 
ferent cases and their responses under treatment —a statisti- 
cal generalization. 

As a matter of fact, whether they know it or not, all case 
workers and technicians do. employ an elementary or crude 
form. of statistical method in arriving at the generalizations 
from"which they undertake to make diagnoses and carry out 
treatment. They are sometimes quite subconscious generali- 
zations based on the technician’s memory of the similarities 
in other cases with which he has had experience. Consider, 
for example, this statement by an able scholar: “. . . It seems 
to me that the argument for prediction is bedeviled by a 
failure to recognize two kinds of prediction: (a) actuarial 
prediction concerning whole populations of people and (b) 
individual prediction concerning the possibilities of action 
for a single person. . . The displacement of the psychological 
point of view is the chief fallacy of statistical social science. 
. . » The only way in which we can predict the chances [sic] 
that a given individual has of behaving in a certain way is 
to study him as an individual and especially his subjective 
mental processes with the aid of subjective categories.” * 

That actuarial predictions do not tell which individuals 
will be involved in the predicted behavior is generally rec- 
ognized by scientists. It does not follow that the chances of 
a given individual's behaving a certain way can be deter- 
mined by other than actuarial methods. Study, if you will, 
a given individual as thoroughly as can be imagined, includ- 
ing his most subjective or subconscious mental processes, 
with the aid of as many subjective categogies as anyone de- 
sires: What possible basis for prediction does all this mate- 
tial provide except a basis for classifying the case as more or 

81 G, W. Allport in Critiques of Research in the Social Sciences 1, “An Ap- 


praisal of Thomas’ and Znaniecki’s The Polish Peasant in Europe and Amer- 
ica,” (Social Science Research Council, New York, 1939), p. 185. 
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less like other cases with which the analyst has had experi- 
ence (directly or indirectly thru reading about cases, laws of 
mechanics, physiology, psychology, or what not), all of which 
represent a formal or informal actuarial basis of every pre- 
diction regarding “the chances that a given individual has of 
behaving in a certain way’? What possible basis of predic- 
tion could the most intimate knowledge of a case provide, 
unless the predicter could interpret this knowledge in terms 
of knowledge of other cases of behavior??? The value of this 
intimate knowledge of the case is not denied. Indeed, such 
knowledge is a necessary basis for refined statistical classi- 
fication and interpretation. We merely point out that this 
knowledge gives predictive power only because of the refined 
classification it makes possible in relation to other cases for 
which more or less reliable probabilities have already been 
formally or informally worked out. Accordingly, this objec- 
tion to quantitative methods from practical workers reduces 
itself to the question whether the generalizations from which 
we must proceed should be arrived at by rule-of-thumb meth- 
ods, or through the definitely quantitative and objective pro- 
cedure of statistical methods. With the proposition so stated, 
even the most vigorous critic of the statistical method will 
scarcely have the temerity to argue. The larger implications 
of this point for social case work will be treated in a subse- 
quent chapter (Chapter XI). 


K. LABORIOUSNESS OF QUANTITATIVE RESEARCH 


While it may be conceded that most of the objections 
which qualitative workers urge against the quantitative ap- 
proach to social problems flow from a sincere conviction 
that a statistical approach is impossible, some of the objec- 
tions derive from less defensible motives. Certainly one of 
the chief objections to quantitative studies in some quarters 
is the heavy burden of routine labor usually involved. As 
Mitchell has pointed out, “a qualitative worker requires 

52 Possibly one reason for the arguments on this point may be a confusion 
as to what is the unit of observation or “case” in any given instance. The 
recurrence of a tic, (or other spasm), for example, may be highly individual to 
Some person, and can be predicted from a study of a large number of occur- 
rences of the tic in this person. But the unit of observation or “case” is 
clearly the tic rather than the individual, and prediction follows the usual 
actuarial methods. 
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hardly any equipment beyond a few books and haftdly any 
helper except a typist. A quantitative worker needs often a 
statistical laboratory, a corps of computers, and sometimes a 
staff of field workers.” ** This is not to say that quantitative 
studies possess any virtue because of their laboriousness. In- 
deed, more elaborate quantitative studies than are necessary 
to secure results adequate for the purpose at hand are merely 
a pretentious kind of inefficiency and waste. On the other 
hand, we should beware of the facile criticism that painstak- 
ing quantitative studies show only what “everyone” (or at 
least the wise) already “knows.” Statistical studies do fre- 
quently show only what was generally suspected, but call at- 
tention to the degree to which the suspicion was correct and 
under what conditions the facts are as “everyone knows.” 
The distinction is an important one, because social scientists 
frequently fail to distinguish between what they know (as 
science defines knowledge, i.e., demonstrable fact) and what 
they “know” as a matter of private conviction, belief and 
feeling. We may profitably recall the words of Pearson in 
this connection : 


“There is a certain school of thought which finds the laborious 
process by which science reaches the truth too irksome; the 
temperament of this school is such that it demands a short and 
easy cut to knowledge, where knowledge can only be gained, if 
at all, by the long and patient toiling of many groups of workers 
perhaps through ea centuries,” 34 


The laboriousness of scientific research will be fully ap- 
preciated only by those who have experienced it. Some idea 
of what it involves in the social sciences will be secured from 
the chapters that follow. Only when an actual project is 
carried .through, however, will the “dirty work” and the 
“headaches” of actual research be fully appreciated. Such 
dismal days and months of routine labor require courage, per- 
severance, and unflagging devotion to scientific ideals. The 
satisfaction of “seeing how things come, out” is also great ; 
and there are perhaps few students who have not felt this 
reward in some degree, whether in the solution of a problem 
in algebra or in some other field. 


33 Wesley C. Mitchell, op. cit., p. 2. 
M Karl Pearson, op. cit., pp. 17, 18. 
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The transition to quantitative methods in the social sci- 
ences is likely to be deplored by those who feel that in this 
field, as contrasted with the other sciences, only “literary” 
and philosophical modes of expression can adequately por- 
tray the subtle “human” characteristics of what is called “the 
situation as a whole.” Many of the subjects with which the 
social sciences deal have been traditionally the province of 
the preacher, the reformer, the journalist, the novelist, and 
the dialectician. These facile practitioners of the art of ver- 
balization have sometimes nothing but ridicule for sober sci- 
entific work in the social sciences. As the line dividing so- 
cial science from literature becomes increasingly clear, athe 
difficulty will of course resolve itself in this as in other fields. 
One of the chief complaints of the literary fraternity is that 
as the sciences advance, they tend to rely on increasingly spe- 
cialized symbols and modes of expression which do not lend 
themselves to oratorical periods and flashy argumentation. 
In the meantime, Mitchell’s prediction for economics un- 
doubtedly also applies to the other social sciences : 


The literature which the quantitative workers are due to produce 
will be characterized not by general treatises, but b numberless 
papers and monographs. Knowledge will grow by accretion 
as it grows in the natural sciences, rather than by the excogita- 
tion of new systems. Books will pass out of date more rapidly. 
The history of economic theory will receive less attention. 
Economists will be valued less on their erudition and more on 
their creative capacity. The advances will be achieved not only 
by conceiving new hypotheses, but also by compiling statistics 
from fresh fields, by inventing new technical methods, by refining 
upon old measures, and perhaps by devising experiments upon 
certain types of behavior. It will be harder for anyone to cover 
the whole field, perhaps quite impossible.®5 


The inevitability of technical developments in methods 
and terminology as a science matures must be faced by social 
Scientists as it has been faced by other scientists. It is no 
longer assumed in the mature sciences that every scientist, 
not to mention the layman, should be able to read with ease 
the technical monographs of every other scientist. The tra- 
dition is still strong in the social sciences, however, that if 
the layman or the literary fraternity finds a monograph dif- 


% Ibid., pp. 6, 7. 
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ficult or unreadable, that is fairly conclusive proof of its 
worthlessness. The journalist and the social philosopher 
with reputations gained during the preceding generation are 
sometimes among the first of those who deplore any new de- 
velopments which they cannot follow without hard work. 

The reaction of Hobbes to Wallis’ work on Conic Sections 
(to take no more recent example) is illustrative. “Instead of 
describing geometrical figures and proofs in Latin sentences 
as had been the invariable custom previously, Wallis intro- 
duced the innovation of lettered diagrams for which students 
since his time may well be grateful; but Hobbes sent [the 
manuscript) back with the comment that he knew not 
whether the proofs were good or bad for it was so ‘covered 
o'er with a scab of symbols’ that he would not even read 
it!” 36 In saying that he knew not whether the proofs were 
good or bad, Hobbes took an entirely proper position and 
was remarkably frank and honest; the more common reac- 
tion would have been for him to have declared the proofs 
bad, because he could not read them, though this reason 
would usually not be admitted. Every social scientist may 
reasonably expect some day to be left behind in one or more 
of the fields of a rapidly advancing science. It requires 
moral courage and scientific humility to accept the position 
of a novice in a field where one is accustomed to being re- 
garded as an authority. But if we do not wish to obstruct 
the advance of science, such relinquishment of authority and 
leadership at the proper time is required of the scientist. It 
is proper for us to maintain a healthy skepticism and a non- 
committal attitude toward the developments which we do 
not understand. But we cannot safely condemn innovations 
merely because they are new, or because we find them diffi- 
cult, or because we do not understand them. 

We have discussed above some general characteristics of 
the scientific method as an approach to the solution of scien- 
tific questions. Before proceeding with a consideration of 
the more concrete techniques involved in this method as ap- 
plied to social phenomena, it may be well to review briefly 
some considerations which influence the selection of ques- 
tions to which the method is finally applied. 

3S. C. Dodd, op. cit, ch. 1. 


nye 


ated 


THE THEORY OF SOCIAL RESEARCH 31 


» 


L. THE SELECTION OF RESEARCH PROBLEMS 


In the most general sense a research problem is deter- 
mined by the circumstances which arouse in someone a curi- 
osity which craves satisfaction. Curiosity in its most general 
sense is merely the “intellectual” (i.e., symbolic or linguistic) 
aspect of a tension in the organism—some imbalance be- 
tween it and its environment or between the parts of an or- 
ganism. Thus scientific research is, in the last analysis, 
merely a more or less refined and abstract type of the same 
general activity which characterizes all life in its gropings 
toward adjustment within itself and with its environment. 
We reserve the name research, however, for only the more 
formal, deliberative and “intellectual” (i.e., symbolic and ab- 
stract) forms of this general adjustment process. Scientific re- 
search, furthermore, is usually thought of as an activity of 
general import to the community, rather than as a struggle 
on the part of the individual scientist to achieve a personal 
satisfaction of his curiosities, although the latter may be fore- 
most among the scientist’s incentives. 

The question whether the scientist should be primarily 
concerned with the solution of problems of more or less im- 
mediate concern to the contemporary world or whether he 
should select his problems primarily to satisfy his own intel- 
lectual curiosity is a matter of frequent controversy. Ac- 
tually, of course, the two interests are likely to coincide to a 
large degree unless the scientist is too sheltered and removed 
from the life of his community and his generation. But it is 
frequently difficult to show the layman that some types of 
Scientific research are relevant to contemporary community 
problems. In such cases the scientist frequently takes the 
position that he is a privileged person who must be allowed 
to follow “truth” or curiosity for “its own sake,” unfettered 
by the current interests or needs of contemporary life. It is 
correctly pointed out that some of the most important dis- 
coveries in science have resulted when men followed merely 
their curiosity without thought of the possible practical con- 
sequences of their work. It is doubtless true, for example, 
that the mathematical researches of Faraday and Clerk Max- 
well seemed to everyone at the time to be quite devoid of 
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practical application. It is also unquestionably true that 
these researches have been found subsequently to be entirely 
basic to our development of the whole field of radio physics. 
The same could be said regarding early experiments in elec- 
tricity and many other fields. Finally, it will be recognized 
by everyone that, if research should become too preoccupied 
with the day-to-day problems of each community, the gen- 
erally valid knowledge, which constitutes the great achieve- 
ment and content of science, could probably never develop. 
There would be no time for such “impractical,” long-run 
projects. There is real danger that an overly-practical and 
short-sighted policy in this respect might dry up the very 
fountainheads of scientific advance through a failure to sup- 
port adequately the more “esoteric” researches into theoret- 
ical and mathematical questions which have always been the 
conditions of progress in science. 

In times of public excitement especially there are likely to 
be outcries from “practical” men, reformers, and even some 
poets, as proponents of “action” against “lecturing on naviga- 
tion while the ship is going down.” It is because some men 
went about their proper business (instead of rushing at odd 
jobs about the ship, which could not be saved, or was not 
worth saving anyway), that we know something about navi- 
gation today. The researches and the lectures survived and 
were found useful on other ships on which, however differ- 
ent from the ships that sank, men found life quite tolerable 
and navigation greatly improved. 

Past experience, may be said, then, to justify the mainte- 
nance by society of a class of people of certain characteris- 
ties and training, commissioned to do nothing but follow 
their “pure” or “idle” curiosity, on the ground that the ap- 
parently purely personal and frequently esoteric pursuits of 
these men occasionally yield, as an incidental by-product, 
results of vast significance to the community and to man- 
` kind. It is important to note, however, that when we make 
this concession we do so not because of any assumed “right” 
of the intellectual to such privileges, but purely on the dem- 
onstrated probability that this policy and practice will yield 
a concrete return, however indirectly, to the community. 
In short, we may resolve the controversy between the con- ` 
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tenders for “practical” research and the “truth-for-truth’s- 
sake” contingent somewhat as follows: The “intellectual” 
is on sound ground when he stresses (a) the need of freedom 
in science to follow curiosities which have no’known rela- 
tion, or at least a very remote and hypothetical relation, to 
the problems that may be agitating the contemporary world ; 
(b) the indirect significance to “practical” problems of such 
freedom and of highly “esoteric” mathematical and other 
research in the fields of symbolic representation ; and (c) the 
justifiability of the long view in science, that is, the contem- 
plation of, and work upon problems in the perspective of 
centuries rather than in terms of individual lifetimes. Scien- 
tists and other intellectuals are, on the other hand, on un- 
sound and unrealistic ground if and when they contend for 
freedom to follow their curiosity, or “truth for truth’s sake,” 
as an end in itself rather than as a means to ends in which 
the larger community has a stake. Unless the intellectual 
can make out a reasonably probable claim that his mysterious 
activities will someday yield a return of a kind that the com- 
munity values, he has no more claim to support than any 
other member intent only on his own amusement or de- 
velopment. 

But science is perhaps in greater danger from overem- 
phasis on “practical” research (in the sense of research on 
local and immediate problems) than on “pure” research. 
It is comparatively easy to get a hundred thousand dollars 
for a social survey of a local community if immediate results 
of value are thought probable. It is very difficult to get a 
fraction of that sum for research on the development of so- 
cial measuring instruments, for example. Yet the latter, 
when developed, become the principal means of reliable and 
economical “practical” research as well as the chief means 
for the advance of science. The matter can perhaps be sum- 
marized most briefly by imagining what would be the state 
of knowledge, research, and practical efficiency in the physi- 
cal sciences if every researcher had to invent anew for each 
research project, instruments for the measurement of time, 
length, weight, temperature, etc. The corresponding state 
of affairs is virtually what today exists in the social sciences. 
There is no systematic theory which stipulates what we choose 
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to regard as fundamental dimensions of social behavior and 
there are few standardized and verifiable definitions and 
measures of the various concepts and dimensions used by 
various researchers.7 Deliberate and coordinated work on 
this key problem might properly engage some of the atten- 
tion of research foundations interested in fundamental re- 
search. 


z. 
M. PLANNING A RESEARCH PROJECT 


Turning now from the above more or less theoretical as- 
pects of the selection of research problems, we may consider 
some of the more practical considerations confronting the 
student in the selection and planning of a research project. 

If a person is employed as a professional researcher, his 
employer will tell him to what questions answers are desired, 
on the basis of the needs of the industry, the government de- 
partment, or the research institute in which the research is 
to be carried on. The same may be more or less true in 
academic research, when the director is interested in testing 
some broad theory through a series of graduate theses. If 
the problems so proposed are strategic or crucial to the fur- 
ther advancement of systematic knowledge in a field, the 
serious student will perhaps prefer to work on such a subject 
rather than try to find some unique topic which has hitherto 
escaped the interest of everyone else, perhaps because of its 
triviality. 

The thoroughly absurd notion (perhaps adopted from his- 
torical research, in which field it may have some validity) that 
graduate theses should be on some new topic not yet touched 
by other researchers, has resulted in the scattering of the re- 
search energies'of thousands of graduate students among a 
vast variety.of unrelated topics bearing on no central scien- 
tific theory at all. It is true that the principal purpose of 
graduate theses is to provide research training for the stu- 
dent, and that this may be best achieved by allowing or com- 
pelling him to formulate his own problem. This is still 

8? This work will necessarily be, to begin with at least, heuristic in nature, 
i.e., different students will propose different theories, concepts, and measures 
of which the most useful will survive in whole or part. An example of one 


of the most comprehensive and systematic of such attacks is found in S. C 
Dodd, Dimensions of Society (Macmillan, 1941). 
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possible, however, within the vast province of conflicting the- 
ories and approaches which characterize the social sciences 
today. It is not a question of confining research to one 
theory. It is rather a problem of making graduate theses 
bear on some theory so that knowledge may grow by sys- 
tematic accretion. The alternative is the mere accumulation 
of random fragments. For the same reason students should 
be encouraged to undertake theses on the same questions 
which others have already studied in order to corroborate 
or refute the previous findings. Only when the findings of 
one researcher have thus been repeatedly tested can we regard 
the results as reliable. The undeveloped state of clearly in- 
tegrated theoretical frameworks in the social sciences is, of 
course, chiefly responsible for the chaotic scattering of re- 
search in these fields. As a science matures this uncoordi- 
nated effort tends to be supplanted by more systematic re- 
search directed at more clearly defined goals. 

The skilled researcher should be able to proceed effectively 
on any significant problem, whether he is motivated with 
great personal curiosity in the matter or not. As a techni- 
cian he should find sufficient motivation in the purely tech- 
nical problems of finding reliable and demonstrable answers 
to the questions asked, even if he does not consider the lat- 
ter of great importance. The main problems of carrying on 
a research project are largely the same regardless of the pre- 
cise character of the question to be answered, and we shall 
therefore consider briefly some of these problems. 

1. When the general subject to be studied has been de- 
cided upon, whether through the researcher's own interest 
or through assignment by others, it is important to define 
and delimit as precisely as possible the as or aspects to 
be studied. This can best be done by contemplating exactly 
what kind of answer, in the sense of form and degree of 
Specificity, is desired. This does not mean that the specific 
result itself can or should be decided in advance. Above all, 
it must be remembered that in research, negative data may 


_be as important as positive data. Many research projects 


fail largely because of lack of clearly defined objectives. For 


38 The beginnings of a more systematic approach are becoming apparent in 
Psychology. See C. L. Hull, op. cit. 
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example, after a year or two of research on many such topics 
as “child life,” “women in industry,” “leisure,” “the news- 
paper,” etc., it is found that, while a vast amount of “‘interest- 
ing material” has been gathered, practically no important 
scientific questions of general interest can be answered. 
Every topic has an unlimited number of aspects and could 
be studied for unlimited periods of time. When a topic is 
suggested for research, each person usually has in mind some 
aspect of it to be studied. Others may have other aspects 
in mind. If each takes the broadly stated topic to mean the 
aspect in which he is interested, great confusion and dis- 
agreement in the appraisal of the results is likely to ensue. 
The researcher, especially the beginner, would do well not 
to undertake to investigate a wide, general topic but insist 
on an explicit statement of the aspects to be investigated. 
This can frequently best be achieved by a list of specific 
questions to which answers are desired. These questions 
become the guide to the planning of the investigation in all 
its details, including, for example, the types of tabulations 
to be employed. If a certain type of question is to be an- 
swered, a certain type of table must be forthcoming from 
the data in order to answer such a question. In order to 
produce a certain type of table, the schedule on which the 
data is recorded must be so constructed that the data neces- 
sary for tne required table will be collected in the necessary 
form. This, in turn, as we shall see later, has everything to 
do with determining the categories and units in terms of 
which the data are to be gathered. Nothing is more wasteful 
and futile, or more characteristic of the inexperienced re- 
searcher than the tendency to engage in an enthusiastic gath- 
ering of data ‘on the theory that, if only enough and suffi- 
ciently varied data are collected, any or all questions can be 
answered from the results. The researcher will find to his 
grief that too frequently his data can be used for the answer 
of even some of the simplest questions only if these questions 
were definitely in mind when the data were gathered. A 
question, then, serves as a framework for an investigation, 
and as every question implies an answer, it requires the re- 
searcher to envision at the outset the form of the answer 
which will satisfy the question with which all inquiry begins: 
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The more specific and definite that question is, the more effi- 
cient and specific the investigation is likely to be. 

This does not mean that broad and general objectives may 
not be specifically set as the purpose of a research especially 
by mature students or to survey a field of problems. There 
are occasions where such preliminary or exploratory studies 
are desirable and necessary in order to formulate the more 
specific problems that everyone vaguely feels are involved. 
The degree to which such preliminary work is necessary will 
depend greatly upon the amount of research which has al- 
ready been done in the field and upon the researcher's fa- 
miliarity with this work. In general, it may be said that stu- 
dents as well as more experienced researchers are prone to 
undertake subjects unintentionally too broad in scope and 
out of all proportion to the researcher’s resources in time and 
money. When the objectives are intentionally broad and 
general, it is important that even these objectives should be 
deliberately, planfully so, if the research is to proceed effi- 


- ciently. 


2. When the subject of a research has been decided upon 
and defined as precisely as possible, it is necessary to make 
some preliminary inquiry as to the feasibility of the under- 
taking from a number of points of view. First, the accessi- 
bility and availability of the sources must be considered. 
No matter how excellent may be the research decided upon, 


_if it depends for its fulfillment upon sources which are not 


available within the resources of the researcher, the project 
is not feasible. To determine this question, the researcher 
must have decided at least roughly what factors the prob- 
lem involves. Second, the relationship of the contemplat 

project to other projects, and especially its dependence upon 
such other projects, needs to be carefully considered. To 
the extent that the successful prosecution of the study is con- 
tingent upon factors outside the investigator’s control, the 
undertaking must be considered precarious. Finally, there 
are questions of the practical purposes and results of the 
project: (a) To whom and in what respects will the results 
be of interest? (b) Will the results bear upon new hypothe- 
ses or will the findings mainly support or refute hypothe- 
ses for which there is already much evidence? (c) Are 
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the results intended for purely scientific purposes, or as data 
for various immediate and practical uses? Any or all of 
these may be significant and valuable contributions. But it 
is important to consider a proposed research from all of these 
points of view because the methods to be employed may de- 
pend to a large degree upon these considerations—for ex- 
ample, the degrees of accuracy and verifiability required, 
public reaction to the undertaking, etc. ° 

3. Practically all research must go forward under fairly 
specific limitations of time and financial support. There is 
a more or less romantic view of research which paints the 
creative worker as one who, decade after decade, pursues the 
elusive problem he has set himself regardless of any other 
considerations. These conditions are approached in the 
cases of well established sciences and endowed research in- 
stitutes and laboratories, and they are, of course, the ideal 
conditions for the solution of fundamental problems. But 
even in such circumstances, periodic reports and budgetary 
estimates are called for. The bulk of research, however, and + 
notably social research, takes place under much less favorable 
conditions. Much of it is undertaken for administrative 
guidance and must be completed within definite tinie limits. 

The conditions under which social research is carried on ' 
are so varied as to prevent here any detailed or itemized sam- 
ple statement of budgets, either of time or money. The 
tendency for both to exceed their advance specifications 
should, however, be recognized and allowed for to, the ex- 
tent of at least five percent of the total budget. Accurate 
estimates of time and money required to carry on and com: 
plete a study depend largely on the possibility of itemizing 
carefully the time, equipment, and assistance which will be 
required. Among the items that usually must be included 
are: (1) Salaries of workers and expenses of meetings and 
conferences, if any. (2) Rental of office and equipment, 
desks, chairs, stenographic and statistical equipment, tele- 
phones, filing cabinets, etc. (3) Supplies — papers, stamps, 
printed supplies, schedules, books, maps, etc. (4) Traveling 
expenses. (5) Special costs, if any—e.g. mechanical tabu- 
lations by an outside agency, drafting of charts, etc. (6) 
Publication and distribution of the report. 
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The proportions of a budget to Be allocated to each of 
these items will vary enormously with different types of 
projects and cannot be suggested in general terms. The re- 
searcher’s own experience, and that of others who have con- 
ducted similar projects, should be consulted. The investi- 
gator’s familiarity with the circumstances under which the 
study is to be, conducted will be his most valuable aid in 
budgeting both time and money. In general, it may be said 
that the inexperienced researcher especially is likely to allo- 
cate too large a proportion of his time and money to the 
gathering of the data. Later he finds himself handicapped 
for lack of both in the analysis and presentation of the re- 
sults. 

4. The techniques of dealing with the problems that arise 
in the course of a study, especially in its first stage, namely, 
the planning of the project and the gathering of the data, 
are the principal theme of the remainder of this book. The 
more generally accepted methods of analyzing the data col- 
lected have been extensively dealt with in a large number 
of general treatises on statistics. The preparation of reports 
and manuscripts has likewise received adequate attention in 
a number of places to which references are made at the end 


_ of this chapter. Illustrations and further elaboration of 


many of the considerations here raised in outline form will 
occur throughout the volume. 


p” N. SUMMARY AND CONCLUSION 


In the foregoing pages we have taken the view that of the 


T various methods which man has employed in his age-long 


struggle to adjust himself to his environment, there is one 
method, the general type we have called the scientific 
method, which has proved itself incomparably superior to 
all others in the results it has achieved. The implica- 
tion seems clear that a method which has proved itself so 
effective in one field of human adjustment should be em- 
ployed also in those fields where man’s adjustment and con- 
trol is relatively imperfect, namely, in his social relations. 
But up to the present, man has made only feeble attempts to 
apply scientific method to his social problems. It has been 
erroneously assumed that the data of human society are so 
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fundamentally different from the data-of the physical world 
and of lower organic life as to make the method inapplicable 
to human relations. 

This general objection, we find, reduces itself into a num- 
ber of factors, all of which are more or less generally regarded 
as fatal obstacles to the formulation of true scientific laws of 
social behavior. Thus it is contended that the data are too 
complex. This is a subjective attitude toward things which 
we have not yet analyzed and hence do not understand. It 
is a condition which flows from the very inadequacy of our 
present description and definition of social phenomena. 

Second, it is contended that social phenomena are subject 
to incalculable whims due to the alleged presence in human 
beings of “‘self-active” brains, “will,” “minds,” “souls” and 
other mystical concepts of an old and rapidly disintegrating 
psychology. We have attempted to show that these whims 
are present in “natural” phenomena as well, and that scien- 
tific law meets the situation by standardizing the conditions 
or abstracting the situation so as to describe behavior under 
these conditions with a high degree of accuracy. 

Finally, there is the fear that, since scientific method tends 
to be quantitative in its description, such description of hu- 
man relations will omit various alleged intangible qualita- 
tive conditions which must be considered. A variation of 
this objection is the contention that a quantitative analysis 
of a situation destroys the relationship between the parts, 
the vital thing in society. We have attempted to show that 
there is reason to believe that the essential qualitative differ- 
ences are all reducible to quantitative statement. As for the 
contention that quantitative analysis and statement do not 
take into consideration the relationship of the parts, this-is 
true only of an imperfect or inadequate quantitative state- 
ment. A mathematical formula describes not only the 
constituent elements of a complex, but their relevant rela- 
tionships. It is not contended that at present all the relation- 
ships betwen social phenomena are susceptible to statement 
in terms of mathematical or other symbols employed in the 

» Cf. S. and B. Webb, Methods of Social Study, (Longmans, 1952): “We 


think that ‘wert’ and ‘sinn’ [value and meaning) enter no more and no less 


into one department of our scientific knowledge of the universe than into 
another.” (p. 12) 
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physical sciences. To conquer new fields, new concepts and 
tools must be invented. They must be developed and im- 
proved through experience as they have been developed in 
the mature sciences. But we must seek these concepts and 
tools in the field of the quantitative and objective, rather 
than in the domain of the qualitative, subjective, and meta- 
physical. 

In this evaluation of the objections to the possibility of 
exact social sciences, there is no desire to minimize the diffi- 
culties involved, nor to claim that much progress in their 
solution has as yet been achieved. But it is important to dis- 
tinguish between great difficulty and impossibility. In view 
of the present admittedly low development of anything prop- 
erly designated as science in the social studies, it may be ob- 
jected that the position we have taken is unjustifiably opti- 
mistic. Our optimism is justifiable, however, in the light of 
the history of science. Every extension of the field of science 
has met with exactly the same objections as are today urged 
against the possibility of scientific method in sociology. The 
prospects of exact social sciences and a consequent trans- 
formation of the social world comparable to that which the 
physical sciences have wrought in the physical world, involve 
a far lesser imaginative leap than the scientific achievements 
of today would call for on the part of the scientists of several 
centuries ago. 

Considerations of this kind are not efforts to make light of 
the difficulties confronting social scientists. These difficul- 
ties are very real; they will require years of patient and dis- 
couraging effort to solve. Some of the more obvious of these 
difficulties are considered in the following chapter. But 
while these difficulties will be emphasized, this should not be 
interpreted as a counsel of despair because of the impossi- 
bility of attaining in present social research some of the ideals 
of mature science that have been reviewed in this chapter. 
In fact, the student should be warned against an unreason- 
able quest for perfection that is unattainable. He who seeks 
complete knowledge and final accuracy in every detail will 
presently be eligible for the psychopathic ward, and, in any 
case, will never reach his goal or finish his work. This book 
will continue to emphasize the importance of accuracy, ade- 
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quacy, and caution in’ the gathering and interpretation of 
data. Yet even these watchwords are relative to the ends 
sought. For practical purposes, there should be in research, 
as in most other activities, a compromise between (a) what 
the researcher knows to be the ideal and (b) the practical re- 
quirements of the project. This compromise should be made 
at a point (a) where the results are sufficiently good for the 
purposes in mind, (b) where the results have some value even 
though imperfect, and, in any case, (c) where, under the cir- 
cumstances, one has done the best one could. In all in- 
stances, however, it is important to recognize and call atten- 
tion to the imperfection of the work which, for whatever 
reason, we are unable or unwilling to carry on to higher de- 
grees of perfection. 

For the same reasons we have recognized the value of a 
large body of literature which can hardly be said to conform 
to the formal requirements of scientific research but which 
is invaluable as sources of hypotheses and clues. These may 
suggest both the formulation and the interpretation of more 
rigorous scientific studies. The scientist has, therefore, no 
quarrel with “literature” from the most impressionistic poem, 
or novel, or life histories to the more objective descriptive 
accounts of ethnology anid community studies of a purely 
literary sort. Neither does the scientist confuse such ma- 
terial with scientific findings. It is doubtless true that the 
ultimate aim of scientific research is to arrive at scientific 
laws. But it is also true that the bulk of social research aims 
not directly at this end, but rather at immediate objectives 
of administration and community adjustments. The prob- 
lems and methods discussed throughout the remainder of this 
book will be found to apply with equal validity to both types 
of research. 


O. SUGGESTIONS FOR FURTHER STUDY 


1. The best training in scientific research is to participate 
in such work under the guidance of a capable scientist. The 
next best training is to be secured through a careful exam- 
ination of the printed accounts such men have left regard- 
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ing the details of their labors. The reading of the present 
text should, therefore, preferably be accompanied by partici- 
pation in an original research project, in the course of which 
the student may experience first hand the problems here dis- 
cussed : arriving at a decision on the problem to be studied, 
formulating hypotheses, planning observations, and collect- 


_ing data. (The analysis of the data belongs perhaps more 


directly in a course in statistics.) Whether or not such a 
project is undertaken, the student should carry on from the 
beginning of the course the study of one or more model re- 
search monographs. These will illustrate the way in which 
competent researchers have wrestled with similar problems 
in some specific cases. Such monographic studies will doubt- 
less be recommended by each instructor according to his own 
judgment of the student’s interests and maturity. The fol- 
lowing are a few excellent examples. 

(a) S. C. Dodd, A Controlled Experiment on Rural Hy- 
giene in Syria, (Oxford Univ. Press, 1934)- Page 1-38 of this 
monograph have never been surpassed as a mo el of clear or- 
derly statement of the methods of a research project. The 
few mathematical statements may be skipped by those who 
are not familiar with elementary statistics. 

(b) E. W. Burgess and Leonard Cottrell, Jr., Predicting 
Success or Failure in Marriage, (Prentice-Hall, 1939). 
Chapters 1-5 are a very fine illustration of the problems and 
methods of approaching one kind of sociological problem. 
The method is applicable to a wide range of problems and 
should be carefully studied. Other studies of the same gen- 
eral type are (c) G. Vold, Prediction Methods and Parole, 
and (d) E. Monachesi, Prediction Factors in Probation, (both 
published by The Sociological Press, Hanover, N. H., 1931 
and 1932, respectively). An excellent non-technical mono- 
graph for beginning students is (e) R. S. and H. M. Lynd, 
Middletown in Transition, (Harcourt, 1937). 

2. Elaboration of the position taken in this chapter re- 
garding the nature of science, the applicability of quantita- 
tive methods in the social sciences, and the nature of scien- 
tific laws, may be found in G. A. Lundberg, Foundations of 
Sociology, (Macmillan, 1939), chs. 1, 2, and 4, respectively. 

3. Regarding the purposes and types of problems to which 
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social scientists should devote themselves, read R. S. Lynd, 
Knowledge for What? (Princeton Univ. Press, 1939), chs. 1, 
4, and pp. 249-50. See also my review of this-book in Amer. 
Four. Sociol., 45:2'70-74, 1939. 

4. On the subject of analysis and interpretation of data 
the following texts, among many others, provide an elemen- 
tary treatment, involving no advanced mathematical knowl- 
` edge. The briefest, and, considering its brevity, the clearest 
summary of the whole subject is by Calvin F. Schmid, chs. 
11 and 12 in Pauline V. Young, Scientific Social Surveys and 
Research, (Prentice-Hall, 1939). The student entirely un- 
familiar with the field, however, will probably require a more 
elaborate treatment such as is to be found in the following: 
For students of economics and sociology, R. E. Chaddock, 
Principles and Methods of Statistics, (Houghton Mifflin, 
1925); for students of education and psychology, H. E. Gar- 
rett, Statistics in Psychology and Education, 2nd Edition 
(Longmans, 1937). The most comprehensive general trea- 
tise is F. E. Croxton and D. J. Cowden, Applied General Sta- 
tistics, (Prentice-Hall, 1940). 

5. Suggestions on the preparation of reports and manu- 
scripts may be found in W. E. Spahr and R. J. Swenson, 
Methods and Status of Scientific Research, (Harper, 1930), 
chs. 12 and 13. See also A Manual of Style, (University of 
Chicago Press, 1935); and W. L. McAtee, “On Scholarly 
Writing and Critical Reviewing,” Scientific Monthly, July, 
1940; Pp: 77-79. 

6. For a thorough discussion of the prediction of human 
adjustment, see Paul Horst et al., The Prediction of Personal 
Adjustment, Social Science Research Council Bulletin No. 
43, 1941. 


CHAPTER II 


DIFFIC ULTIES OF OBJECTIVE OBSERVATION ; 
EXPERIMENTAL TECHNIQUES 


In the most advanced communities, and even among the adepts of modern 
science, there comes up persistently the revulsion of the native savage against 
the inhumanly dispassionate sweep of the scientific quest, as well as against 
the inhumanly ruthless fabric of technological processes that have come out 
of this search for matter-of-fact knowledge. . . Furtively or by an overt 
breach of consistency, men still seek comfort in marvelous articles of savage- 
born lore, which contradict the truths of that modern science whose dominion 
they dare not question but whose findings at the same time go beyond the 
breaking point of their jungle-fed spiritual sensibilities. — Thorstein Veblen 


A. THE NATURE OF KNOWLEDGE 


Our knowledge of the world must be secured through our 
response organs, commonly referred to as our senses.? The 
correctness of our knowledge, that is to say, the similarity of 
our perceptions when compared with other people’s, depends 
in the last analysis upon the physical soundness of these 
response organs. But, unfortunately, and unlike a telescope 
or other mechanical tool of observation, the mere mechani- 
cal perfection of our response organs does not insure uni- 
formity of perception among different observers. This 
differential in the perceptions of different persons when ex- 
posed to the same stimuli derives from several sources : (1) 
the perception of all but the simpler and cruder stimuli is 
the result of the training and conditioning of our response 
organs ; (2) our responses are influenced by physiological and 
environmental conditions — fatigue, age, temperature, etc. ; 
and (3) the reception and interpretation of the stimuli re- 
ceived is determined by our “‘apperceptive mass,” that is, the 
sensitivity we have developed and the responses we have 
learned, as a result of past experience. “Men observe with 
the eye of the past.” ® 

‘4 The Place of Science in Modern Civilization, (Viking Press, 1932), PP- 
. i For elaboration of this statement, see The Philosophy of John Dewey, (P. 


A. — Editor. Northwestern Univ., 1939), p 538. See also J. F. Markey, 
The Symbolic Process and its Integration in Children, (Harcourt, 1928), pp- 


14-15. z K 
å Columbia Associates in Philosophy, An Introduction to Reflective Think- 
ing, (Houghton Mifin, 1923), p- 24- 
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B. THE NATURE OF BIAS AND PREJUDICE 


The neuro-muscular set (symbolic mechanisms), which 
thus determines what shall be perceived and how it shall be 
interpreted is frequently referred to as bias or prejudice. 
In the case of the latter term, at least, the implication is that 
it is in some measure a weakness and a wicked thing, spring- 
ing from motives of self-interest or other selfish considera- 
tions. This attitude toward a perfectly natural phenomenon 
represents, of course, the survival of an antiquated psychol- 
ogy and of theology. While the practice of imputing ethical 
and “intellectual” motives to the reactions of the rest of the 
animal world has been abandoned for some time, it still per- 
sists in regard to man. Thus, to use an illustration from 
Raymond Pearl: “An amoeba behaves quite differently in 
going after his food according to whether this food is capable 
of running away, or is something like an algal cell that nat- 
urally stays quiet. In the former case the amoeba makes pro- 
vision in advance to prevent the prey’s escape ; in the latter 
it closes in at once without any preliminaries. When a man 
acts in this way, behavior is commonly called ‘intelligent’ or 
‘shrewd’.”* It should be added that such behavior on the 
part of a man is also “wicked” if the object of his approach 
is socially taboo. To the behaviorist, bias and prejudice, 
being merely the more or less permanent conditioning of our 
nervous system with respect to certain stimuli, is, from the 
scientific standpoint, as devoid of moral significance as the 
behavior of the ameeba. It is simply a normal property of 
protoplasm to have its future behavior tendencies condi- 
tioned and affected by its past behavior. Says Bernard: “All 
of us undoubtedly receive permanent organic and emotional 
and intellectual biases in our early years which we never fully 
overcome. Indeed it is not abnormal for us to do so.” 5 

Reduced to its simplest form then, bias may be said to be 
a conditioned response which predisposes an individual to 
make his observations fit a previously established system of 
evaluations and habituations. 


4 Raymond Pearl, “The Biology of Health,” Amer. Mercury, 6:386, 1925- 
5L. L. Bernard, Introduction to Social Psychology, (Holt, 1926), p. 219- 


OBJECTIVE OBSERVATIONS 47 


C. THE PREVALENCE OF BIAS AND PREJUDICE 
IN THE SOCIAL SCIENCES 


The fact that the perception and response tendencies called 
bias and prejudice in human beings are a natural and inevi- 
table condition, does not reduce its importance as a problem 
to be reckoned with in the employment of the senses in 
scientific observation. Bias and prejudice are complicating 
factors in all sciences, but their importance is much less in 
the physical than in the social sciences. The chief reason for 
this is that the subject matter of the physical sciences tends 
to be more removed from the common emotional complexes 
which we develop toward many social phenomena ; and hence 
the perceptions by normal senses of physical facts tend to be ` 
fairly uniform. As Keller has said: “A man can count the 
legs of a fly and report his findings without having his heart 
wrung because there are too many or too few.”’® But it is 
very difficult for the person who dearly loves his cocktail to 
induce himself to observe and report that prohibition seems 
to have been an economic boon to the community which he 
is studying. Likewise there is the example of the eminent 
historian who, on being confronted with various documen- 
tary facts tending to throw much doubt on his preconceived 
opinion as to responsibility for World War I, made this star- 
tling admission ` 


The subject is too involved, the underlying race and language 
antipathies are too strong, the confusion of relations in Eastern 
Europe too complex to make any review of printed i a 
safe basis for aah ly an opinion which was forged by the fires 


of war.” [Italics ours. 
D. INADEQUACY OF SUBJECTIVE JUDGMENTS 


While this tendency to observe and interpret data accord- 
ing to our prejudices is especially great in considering social 
phenomena — matters on which we have strong emotional at- 
titudes of love, hatred, reverence, etc.—it is by no means 

6 “Sociology and Science,” The Nation, 102:475, 1915. 
1 Albert Bushnell Hart, Current History, 20:196, 1924. Comment on an 


article by Harry Elmer Barnes on “Assessing the Blame for the World War,” 
ibid. 
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confined to the field of social relations. We have a certain 
acquired sense of the “fitness of things,” “reasonableness,” 
and “common sense” which frequently causes us to refuse to 
accept a conclusively demonstrated proposition even when 
the data involved are of purely academic interest and do not 
touch our lives in any way. For example, there is this fa- 
miliar hoax : 


Suppose an iron ring was placed around the earth at the 
equator, assuming the surface to be smooth so that a close fit 
could be obtained. Suppose the ring were cut in one place and 
seven additional feet were inserted into the circumference. How 
far would this cause the hoop to stand out from the earth ? 


When people are asked to give an impressionistic guess as 
to what effect this increase in circumference would have on 
distending the circle, the practically universal response is 
that the difference would be “imperceptible,” “enough to 
put a very thin paper under,” “one thousandth part of an 
inch,” etc. And yet it can be demonstrated by a process 
known to every schoolboy (the relationship between circum- 
ference and diameter of a circle) that the hoop in question 
would have its diameter increased by approximately two feet 
as a result of the addition of seven feet to its circumference, 
thus causing the hoop to stand out at least a foot from the 
surface of the earth all around. But this easily demonstrated 
conclusion is usually rejected as preposterous on the spur of 
the moment even by scientists and mathematicians. The 
only reason they can assign for refusing to accept the ines- 
capable conclusion is that it “doesn’t seem reasonable.” All 
that this means, of course, is that the results do not seem to 
conform with their sense experience within a realm very 


much more circumscribed and subject to direct and unaided 
sense impression. 


E. THE ““COMMON-SENSE” FALLACY 


Psychological and statistical treatises are full of examples 
of the enormous and ludicrous errors of our senses, unaided 
and uncorrected by objectifying devices outside of our or- 
ganic equipment. Nevertheless, the great majority of peo- 
ple, including most of the so-called educated, will without 
hesitation prefer to follow what they call their “reason” or 
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“common sense” when the conclusion reached through these 
much-revered faculties clashes with the findings of instru- 
mentally corrected sense impressions and logic. Herein lies 
one of the most important tests of the true scientist: has he 
the power coldly to set aside his preconceived subjective no- 
tions of the outcome of an experiment or investigation when 
the carefully checked objective data lead to other conclu- 
sions? It is perfectly desirable and defensible for him to use 
his subjective impressions and his empirical conclusions as a 
basis for skepticism of the conclusions reached by objective 
methods, especially when such skepticism leads him to repeat 
the experiment or to recheck his data. For this purpose, 
“common-sense” and “reasonableness” will always be the 
criterion by which results will be judged. “The mischief occurs 
when these subjective judgments are accepted as infallible, 
as substitutes, or as more reliable than objectively verifiable 
conclusions. The objective checks are, of course, not devices 
to be regarded as means in themselves of gaining knowl- 
edge of the facts. They are merely tools to assist and refine 
the senses when these organs are called upon to make observa- 
tions to which they are unadapted, unaccustomed, or under 
emotional or other handicaps. As a matter of fact, the valid- 
ity of a person’s “common-sense” itself depends on how ob- 
jectively that sense has been developed. It is only by virtue 
of objective external checks that our sense impressions have 
developed a validity even in the simpler adjustments of life, 
governed by “common-sense.” The misjudgments of dis- 
tance, size, and shape of a young child become corrected 
only by means of objective contact with the phenomena ob- 
served, 


F. THE DEVELOPMENT OF OBJECTIFYING DEVICES 


The recognition of this fallibility and inadequacy of the 
unaided senses has led to the invention of an impressive ar- 
ray of artificial devices for the objectification, standardiza- 
tion, and extension of crude sense impressions in the physical 
sciences. In fact, the development of these sciences has been 
very directly contingent upon developments in the tools 
whereby observations may be standardized. The require- 
ment that scientific facts must appear the same to all normal 
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minds has made necessary (1) the development of objective 
devices whereby the senses of different persons can be made 
to perceive alike, and (2) the development of a standardized 
system of symbols and terminology by which sense experi- 
ences could be accurately described and communicated un- 
ambiguously to others. The second of these developments 
will be treated in the following chapter. Let us consider 
here the nature of the devices which the physical sciences 
have invented to standardize observation. 


G. TYPES OF OBJECTIFYING DEVICES 


The paraphernalia invented for the purpose of improving 
our observation of phenomena, and thus rendering more 
adequate the first step in scientific method are of three gen- 
eral kinds: (1) The devices invented for the purpose of se- 
curing a more definite delimitation and measurement of 
units of observation. In this category fall the scales, calipers, 
rulers, graduates, etc., without which physical science, and 
the arts resting upon them, would be capable of only the 
crudest generalizations and manipulations. (2) The devices 
invented for the purpose of projecting the senses or increas- 
ing their power. In this category fall such instruments as 
the telescope, microscope, and the microphone. (3) Devices 
which serve both of the above purposes in that they both 
intensify the powers of the senses and measure the phenom- 
ena in standardized units. Such instruments are exemplified 
by the thermometer and the barometer. Upon the inven- 
tion and development of these devices for the projection of 
the senses and the standardization and objectification of 
sense impressions the advance of physical science has been 
directly dependent. 

The importance of these external objectifying devices in 
the development of physical science suggests that correspond- 
ing devices must also be invented to correct and standardize 
the observations of social phenomena, if the same degree of 
accuracy is to be achieved in the social sciences. In fact, 
such devices are of even greater importance in the social 
sciences because of the greater natural tendency, already 
mentioned, toward bias and prejudice. 
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What progress, then, has been made toward objectifying 
and standardizing the observation of social phenomena and 
what are the possibilities in this direction ? 


H. OBJECTIVE TRENDS IN THE SOCIAL SCIENCES 


While it must be admitted at the outset that there does 
not exist in social science any device of social observation or 
measurement comparable in nature or accuracy to the micro- 
scope or the calipers, the transition toward objective stand- 
ards is definite. In the development of social theory, for 
example, Bernard sees “a more or less constant movement 
toward an objective statement of social problems. Hobbes’ 
philosophy was a protest against the subjective and noumenal 
character of Scholasticism. Locke pointed out, in his own 
terminology, the relativity of subjective presentations and 
criteria. Bentham’s professed and real purpose was to ob- 
tain a constant and scientific experimental basis for regulat- 
ing morals and legislation, under which regulation everyone 
should be equal in privileges. .. His followers, John Stuart 
Mill, Spencer, the social or neo-utilitarian ethicists, and the 
early sociologists have constantly extended the analysis and 
have modified the criterion, till at last its hedonistic charac- 
ter is largely destroyed or disguised, though its subjectivism 
remains,”® Likewise the transition from a purely historical 
to a comparative method in history and anthropology is a 
trend toward objectivism. 

The tendency in recent times has been toward the induc- 
tive approach and the quantitative statement of social ob- 
servations. This trend has brought with it as an essential 
counterpart, some devices, as yet crude, for the objective ob- 
servation, measurement, and weighing (scoring) of social 
data. Up to the present these devices have taken the form 
of schedules, score cards, scales, and various other statistical 
devices. The greater part of the remainder of the book is 
devoted to the consideration of these all-important devices, 
upon the development of which largely depends the future 
of social science. 


$L. L. Bernard, The Transition to an Objective Standard of Social Control, 
(University of Chicago Press, 1911), pp- 88-89. 
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J. INTELLECTUAL HONESTY AND THE SCIENTIFIC METHOD 


Important as is the development of instruments and tech- 
niques of observation and measurement in science, it should 
not be assumed that either techniques or instruments are 
substitutes for an alert mind and common intellectual hon- 
esty. It sometimes happens that a scientist’s greed for ap- 
plause becomes greater than his devotion to truth. These 
are the strange cases in which a scientist will deliberately 
“doctor” his data in order to cause them to conform to a pre- 
conceived conclusion. This is plain cheating, and, when it 
occurs, it undermines science. Errors of this kind cannot be 
guarded against or corrected by instruments. The safeguard 
may be called an ethical one, and like all ethics, it is rela- 
tive to the end sought. That is, scientific training includes 
training in certain ideals (standards of accuracy, veracity; 
etc.) as necessary means to the end sought. From this point 
of view, there is no need of introducing “ethics” as a separate 
category, since it is merely an aspect of scientific competence, 
without which scientific ends are imperfectly achieved. The 
competent scientist will therefore avoid deliberate corrup- 
tion of his methods for the same reason that he avoids care- 
lessness at any stage of his work. Since a corruption of the 
data is incompatible with the end sought, training in scien- 
tific method and in scientific ideals (standards) is perhaps the 
only safeguard against errors of this kind. 

In those fields of observation in which objective measure- 
ment is not yet developed, familiarity with the psychological 
bases of prejudice is perhaps most helpful in enabling us 
to escape its warping influence. The person who is well ac- 
quainted with the etiology of his desires and emotions, the 
mechanics of their behavior, and their influence on his ob- 
servations, will take both objective and subjective precau- 
tions to correct for their influence. Thus Darwin urged that 
it is well to write down facts contradictory to a particular 
theory that we wish to test, for otherwise we are likely to 
forget them.® Judges will sometimes decline to sit on cases 
with which their connection has been such as to make a bias 


*W. F. Ogburn, “Bias, Psychoanalysis, and the Subjective in Relation to 
the Social Sciences,” Publ. Amer. Sociol. Soc., 17:66. spat. 
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probable. Sometimes a person's consciousness of a bias is 
so strong as to cause him to “lean over backwards” in trying 
to correct it. While scientific measurement is the only ade- 
quate method of correcting the influence of bias, a knowledge 
of the roles which desire, rationalization, fantastic thinking, 
etc., play in influencing our observations will help us to 
avoid the effects of some of our cruder prejudices.” 


J. THE NON-ETHICAL NATURE OF SCIENCE 


A more common and less conscious source of error is the 
tendency of scientists, and especially of social scientists, to 
permit the current code of morals of the community or their 
own personal notions of ethics to influence them in their 
collection and manipulation of data. The findings of science 
are per se non-ethical. It is not the business of a chemist 
who invents a high explosive to be influenced in his task as 
a scientist by considerations as to whether his product will be 
used to blow up cathedrals or to build tunnels through the 
mountains. Nor is it the business of the social scientist, in 
arriving at laws of group behavior, to permit himself to be 
influenced by considerations of how his conclusions coincide 
with existing notions, or what the effects will be of his find- 
ings on the social order. For his purpose as @ scientist, 
science must be an end in itself. As a human being, the 
scientist may properly look to the social results of his con- 
clusions. While he may not be in any better position than 
any other intelligent and informed person to say what society 
should aim at, he should be invaluable in providing the 
most efficient technique whereby whatever ends are sought 
may be reached. But asa scientist his only ethical responsi- 
bility lies in seeing that the rules of scientific procedure have 
been complied with. Here again the only safeguard is thor- 
ough training in scientific method. 

This does not mean that the social scientist may not as a 
scientist devote himself to questions of social welfare, how- 
ever defined, or to other questions involving an appraisal or 
an evaluation with reference to an end. It merely means 

10 For illustrations of the operation of such factors see L. L. Bernard, Intro- 


duction to Social Psychology, (Holt, 1926), ch, 13. See also John Dollard, 
Caste and Class in a Southern Town (Yale Univ. Press, 1937), ch. 3. 
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that he should be clear in his own mind, and make it clear 
to others, when he is dealing with “what is” and with “what 
ought to be.” Provided this is clear, the social scientist is 
not only justified, but is perhaps the only person qualified to 
specify what changes in existing conditions will bring about 
an elimination of conflicts or conditions which for any rea- 
son are regarded as obstructive to whatever end the commu- 
nity may (rightly or wrongly in the scientist’s opinion as a 
human being) regard as desirable. From this point of view, 
the scientist who is dealing with “what ought to be” is only 
engaged in the scientifically legitimate practice of prediction, 
based on observations of “what is.” He is merely predicting 
that a certain manipulation of factors in “what is” will result 
in something different. If he proceeds further and advocates 
the new product rather than the existing one, however, he is 
proceeding according to an ethical and not according to a sci- 
entific canon. There is no logic involved in science which re- 
quires him to take the last step. If he takes this step, he does 
so not as a scientist, but as a human being moved by an ethi- 
cal motive. 

There are those who feel that this interpretation of science 
reduces it to a cold technique devoid of human motivation. 
To them it might be pointed out that the pure scientist finds 
abundant motivation in the faith that only by a knowledge 
of “what is” can we make any practical programs of “what 
ought to be.” Without this foundation our utopias are futile 
because they may be impossible or undesirable. In short, 
the scientist has ample motive in his belief that only by a 
more perfect knowledge of the nature of his universe can 
man adjust himself intelligently to it.™ 


K. THE EXPERIMENTAL METHOD IN THE SOCIAL SCIENCES?” 


The development of objectifying devices, as well as their 
accurate employment on the phenomena to be studied in 


11 For a more thorough discussion of this subject and the problem of “val- 
ues” in the social sciences, see G. A. Lundberg, Future of the Social Sci- 
ences,” Sci. Monthly, (Oct. 1941, 346-359). Also Foundations of Sociology, 
PP. 29-31, 527-25, 533-34- 

12 The material in this and the following section is largely an adaptation of 
Professor F. Stuart Chapin’s pioneer contribution to this subject in his “The 
buon gop Method in Sociology.” Sci. Monthly, 4:298-47, 1917. The quo- 
tations and most of the illustrations and analysis are drawn from this source, 
by permission. 
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the physical sciences, has been accomplished for the most 
part in the laboratory by the experimental method. Among 
the great advantages of this method are, first, that it permits 
the repetition of observations under practically identical 
conditions. This facilitates the verification of the observa- 
tions by many observers. Second, it enables the observer to 
vary only one condition at a time, and to maintain all other 
conditions constant. This allows us to analyze the relations 
of cause and effect with much greater speed and certainty 
than is possible under uncontrolled conditions. For, if two 
conditions are varied at one time, it is not possible to tell 
which condition produced whatever effect occurred, or 
whether both acted jointly, or whether they neutralized 
each other. But under experimental conditions this diffi- 
culty is obviated by artificially controlling the variables. Be- 
cause of these tremendous advantages of the experimental 
method, the question of its applicability in the social sciences 
becomes one of major importance. If this method is not ap- 
plicable to social phenomena, it seems reasonable that the 
social sciences will be greatly handicapped, if not entirely 
thwarted, in their maturation. 

The social scientist will probably never be able to bring 
into his laboratory a piece of society, there to observe its be- 
havior in a test tube under varied conditions. But neither 
have astronomers ever brought the solar system into any 
laboratory. Their science has prospered, nevertheless. It is 
true that astronomical observatories contain very ingenious 
symbolic and mechanical representations of the astronomical 
aspects of the solar system and remarkable instruments for 
observing it. These, every science, and especially the social 
sciences, must develop. The experimental and laboratory 
techniques must in any case be adapted to the subject mat- 
ter in which we are interested, and need not have much in 
common with those of physics and chemistry. 

The particular technique of the experimental method 
varies widely in different fields. In biology, for example, we 
find it consists of the exposure of animals, as individuals and 
as groups, to varied environmental conditions and of the ob- 
servation of their behavior. In psychology, the experimental 
method has been developed and applied to a great extent to 
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the behavior of human individuals, including their social 
behavior. The difficulties involved in such experimentation 
on large social groups are apparently formidable. But since 
the technique of experimentation must vary with the nature 
of the subject matter, and since very little attention has yet 
been given to the development of such a technique in the 
social sciences, the assumption that this method is not ap: 
plicable to social phenomena is hardly warranted. 

In fact, the use of the experimental method in various 
forms in the social studies is by no means unknown or even 
new. It has been extensively employed in the past and is 
being increasingly employed at present. Thus, the social 
studies have always availed themselves of what Chapin has 
called “natural experimentation.” This consists of observ- 
ing social behavior where nature has limited or held con- 
stant certain physical factors. The social facts observed in 
such areas are compared with those observed in other regions 
where these physical factors are or are not present. Of 
course, if we define experimentation as taking place only 
when there is actual human interference with the determin- 
ing conditions, this natural experimentation is not experi- 
mentation at all. In any case, it is a very crude method be- 
cause of the presence of many other variables than the gen- 
eral geographic and physical differences. But even with these 
limitations, the method has been of tremendous value to 
anthropologists and sociologists. For example, the isolation 
from the rest of mankind, the lack of variety of flora and 
fauna, and the climate have been taken to explain many of 
the customs and modes of living of the Eskimo, such as patri- 
cide (owing to the scarcity of food), the elementary organiza- 
tion of property rights, and absence of commercialism. Like- 
wise, the survival of colonial customs in the Appalachian 
highlands of the south, while the rest of the country has been 
revolutionized, constitutes a natural experiment in the ef- 
fects of isolation. While such studies have great suggestive 

value, they are not true experiments in that the conditions 
are not subject to the manipulation of the observer. As a 
result, there are too many varied factors present. Further- 
more, as the number of similar cases available for observa- 
tion are few, and since they are difficult to observe properly, 
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this must be regarded as only the crudest approximation to 
an experimental study. 


L. THE UTOPIAN COMMUNITY, “EXPERIMENTS” 


Perhaps the best examples of avowed social experimenta 
tion are to be found in the Utopian Community experiments 
conducted by the associationists of the last century. In the 
establishment of such communities as New Harmony (1825) 
and Brook Farm (1841—47) the attempt was deliberately made 
to eliminate certain competitive conditions obtaining in soci- 
ety at large in order to achieve personal freedom, congeniality 
among people of minority opinions and even economic pros- 
perity. Robert Owen, believing that the evils of the capital- 
istic system resulted from private property, orthodox religion, 
and the institution of marriage, established the commu- 
nity of New Harmony for the avowed purpose of testing 
his hypothesis by omitting these three institutions in their 
usual meaning. The community soon failed and broke up. 
On the assumption that these three fundamental human in- 
stitutions were actually eliminated or reduced to constant 
elements, we have experimental proof of the instability of 
society without them. But unfortunately, the superficiality 
of the experiment allowed the presence of a number of other 
variables, each of which might, and, in fact, probably does, 
explain the break-up of the community. Thus, there was 
the heterogeneity of the population and a resulting “absence 
of like-mindedness,” owing to the imperfect control of the 
experiment itself, Furthermore, there was a lack of isola- 
tion from the disturbing influences of outside society, as well 
as the varying social heritage and different value systems 
which the members themselves brought with them. People 
born and bred under other social conditions could not, of 
course, at once develop a new set of habits of thought and 
conduct. This inability to eliminate the prejudices, customs, 
and standards of outside society at once destroys the validity 
of the experiment. As Chapin has said, “It was like attempt- 
ing to conduct a chemical experiment in a bowl of molasses.” 

The somewhat similar experiments of Fourier, while more 
successful than the New Harmony attempt, also failed to es- 
tablish themselves on a permanent basis. But here again, 
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the social standards of outside society, to which members of 
the experimental communities were still susceptible, make 
it impossible to draw any conclusions as to the social value 
of the innovations tried in the communities themselves. ‘The 
conclusion that it was the lack of isolation from the sur- 
rounding society rather than the innovations per se of the 
experimental communities which destroyed them is strength- 
ened by the fact that where greater isolation has been se- 
cured, the experiments have sometimes succeeded. ‘Thus, 
the Mormons of Utah inaugurated a social order in many 
respects unique, and flourished under it for many years. It 
is true that a new religious revelation was the essential con- 
dition in the success of this experiment, as in practically all 
others that may be regarded as successful. The religious fac- 
tor is important in that it provides an accepted authority, 
thus minimizing dissension. 

These experiments suggest that perhaps valid social experi- 
mentation is difficult because experiments must always take 
place in a social medium, the complicating effects of which 
we do not yet, at least, know how to calculate and allow for. 
The only way to avoid the complicating effects of the larger 
society surrounding a small community where an experiment 
is being tried, would appear to be to attempt the experiment 
on the larger society itself, that is, on the state or the nation. 
This, again, is by no means an unknown form of experimen- 
tation, as we shall see below. Sovereign states, through legis- 
lation, have long been engaged in experiments of this type. 


M. SOCIAL LEGISLATION AS SOCIAL EXPERIMENTATION 


Although legislation is not usually enacted for the purpose 
or in the spirit of experimentation, it is undoubtedly the 
most important type of social experimentation on a large 
scale in existence today. Legislation or enactments such as 
the Prohibition Amendment, the Woman's Suffrage Amend- 
ment, the government operation of railways during the first 
World War, the ventures into government ownership and 
insurance and much social legislation enacted under the 
“New Deal” 1933-36, are clearly modifications and innova- 
tions of an experimental nature. Overshadowing in scope 
and importance all such experiments, ancient and modern, 
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is perhaps the Soviet regime in Russia. The results of all 
such experiments, if accurately measured and reported, would 
clearly be of the greatest importance to social science. 

The difficulty with this form of “experimentation” is that 
it is almost entirely of the trial-and-error type. Much impor- 
tant legislation is enacted every year and important social 
changes take place every year. But whether the legislation 
is causally related to the changes in the department directly 
aimed at by the legislation we have no definite method of 
determining. When the direct effects of the legislation are 
reasonably clear, these effects may themselves be the causes 
of other problems in other departments of society. Thus, a 
stringent divorce law may effect its immediate purpose, 
namely to cut down the number of divorces. But it may 
also cause an increase in illicit relations and illegitimacy. 
The trial-and-error process of legislation with individual un- 
related statutes instead of scientific programs of legislation 
anticipating the effects on all other fields, is, like all trial- 
and-error experimentation, a blundering and wasteful proc- 
ess. But it is perhaps the only method possible in a new 
field, where the knowledge of the significant factors and their 
interrelationship is very limited. 

Here we are confronted again with the principal difficulty 
of experimentation in the social sciences—the difficulty of 
controlling or calculating all the variables. This is due to 
two chief causes: (1) the complexity of the problem and (2) 
the undeveloped state of our methods of social measurement. 
The first cause has already been shown to flow from our 1g- 
norance of the factors involved. The difference between the 
trial-and-error method prevalent in social experimentation 
today (prevalent in all sciences in their early stages), and the 
highly controlled experiments of the physical sciences is that 
in the latter case all the important conditions are known. 
Furthermore, very exact methods of measuring these condi- 
tions have been developed. Hence their influences on the 
gross results of the experiment can be allowed for. The re- 
sult is that the outcome of such experiments can be attrib- 
uted with comparative certainty to a single variable. | Fre- 
quently this result is attained by using “control” individuals 
or groups. Thus if a biologist desires to determine the effect 
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of a certain food on the physiology of rats, he selects two 
groups of rats as much alike as possible and maintains them 
as nearly as possible under the same conditions. One group 
is fed the particular food the effects of which the biologist is 
studying. From the other group this food is withheld. 
Careful measurement of the physiological conditions of the 
rats of the two groups reveal what differences, if any, are to 
be observed in their size, weight, or other conditions. The 
differences are then properly assumed to result from the fac- 
tor present in the one case, but absent in the other. 


N. SCIENTIFIC SOCIAL EXPERIMENTATION 


The importance of accurate methods of measurement in 
social experimentation is further emphasized by the fact that 
perhaps our most successful social experiments have been in 
the field of public health. The fact that such objective meas- 
ures as death rates are available in this field, and the fact that 
much is known regarding the conditions causing some dis- 
eases, enables us to measure the results of health programs 
with considerable accuracy. An excellent example of this 
technique is found in the Framingham Community Health 
and Tuberculosis Demonstration. The Metropolitan Life 
Insurance Company in 1916 placed $100,000 at the disposal 
of the National Association for the Study and Prevention of 
Tuberculosis to conduct an intensive experiment to deter- 
mine whether it is possible substantially to reduce the mor- 
bidity and mortality of tuberculosis. First, a thorough study 
of existing conditions both of the people and their environ- 
‘ment was undertaken. This included a study of the accuracy 
of the reporting of vital statistics, a study of general sanitary 
conditions, milk production and handling, a tuberculosis 
survey of cattle, food sanitation in stores, school hygiene, in- 
dustrial hygiene, etc. The people themselves were studied 
with reference to the prevalence of physical defects, social 
and economic status and other objective measures of the ex- 
isting conditions. All this material was carefully recorded 
in such form as to make possible rigid comparisons after the 
health program had been put in force. A comprehensive 
health campaign was then inaugurated. At the end of the 
first five-year period of the experiment the city showed a re- 
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duction of sixty-seven percent!’ of the tuberculosis mor- 
tality rate. The experiment is notable for the care with 
which the records were kept throughout the period and the 
allowance for possible variations in other conditions at’ the 
beginning and at the end of the period. 

Foremost as an example of meticulously careful experi- 
mentation on entire communities is $. C. Dodd's study of the 
effect of education in hygienic practices in rural ‘villages in 
Syria.* Especially worthy of note in this case is the care 
with which measuring instruments were perfected and the 
sample villages chosen, before the experiment was, under- 
taken. Dodd divided the villages into two samples for ex- 
perimental purposes and carefully measured, with specially 
prepared and tested instruments, the hygienic status and 
practices of each. One sample was then subjected to a two- 
year period of hygienic education through an itinerant clinic. 
‘At the end of this period the hygienic status of the two sam- 
ples were again carefully measured. The comparison of 
the scores of the experimental and the control villages yielded 
a reliable measure of the results of the program of health 
education. It was found that (a) all of the villages improved 
significantly in their hygienic practices and conditions dur- 
ing the period considered, and that (b) the experimental vil- 
lage improved more, though the amount of the improvement 
was not statistically significant, probably because of (c) the 
difficulty of isolating entirely the control villages from the 
influence of the program of health education conducted in 
the experimental village. Nevertheless, the study is notable, 
as a model of scientific proficiency in the care with which the 
measuring instruments were prepared, tested, and applied, 
as well as in the attention to problems of sampling and prob- 
able errors. j 

As an example of more restricted laboratory experimenta- 


13 National Association for the Study and Prevention of Tuberculosis, 
“Framingham Community Health and Tuberculosis Demonstration,” Framing- 
ham Monograph, No. 9, 1922, P- 4- 

For other notable examples of social experimentation, see Edgar Syden- 
stricker, “Statistical Evaluation of the Results of Social Experiments,” Jour. 
Amer. Statis. Asso., 23: (N.S.), 1928. Supplement, (Proceedings of the Eighty- 
ninth Annual Meeting), pp. 155-65- 

14 See the reference to this model study in the “Suggestions for Further 
Study” at the end of Chapter I. 
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tion on carefully controlled social groups, there is Lippitt’s 
study of the effect of democratic and authoritarian group 
atmosphere upon the behavior of two children’s clubs.1° The 
clubs were formed after careful consideration of their com- 
parability with regard to individual characteristics of their 
numbers, cultural homogeneity, position and relationship of 
members to each other, and to the larger social field, such as 
the class in school. In one of these groups the leader encour- 
aged a maximum amount of choice and decision after discus- 
sion by the members on all matters. In the other club, the 
leader made these decisions for the most part and, in gen- 
eral, ordered the pupils about in authoritarian fashion. Ex- 
tensive stenographic, descriptive “total behavior,” and sta- 
tistical records of the ensuing behavior were kept.° These 
results are rich in hypotheses as to the effect of different 
kinds of social “climate” upon social behavior and interper- 
sonal relations. The study is notable for its attention to 
detail in setting up the experiment and for the variety and 
ingenuity of its methods of analysis. The sociometric and 
topological methods used in the latter connection are of spe- 
cial interest. These will be discussed in a later chapter in 
connection with the work of Moreno and others in this field. 
Moreno’s numerous experiments on the social effects of 
changing the personnel of groups in the Hudson State Train- 
ing School, as well as his experiments in psychodramatic 
work, are valuable as examples of the observation of social 
behavior under controlled conditions.17 

The most careful and comprehensive experimental studies 
of the effect of varying conditions upon group behavior is to 
be found perhaps in the literature of industrial psychology. 

15 R. Lippitt, “An Experimental Study of the Effect of Democratic and Au- 
thoritarian Group Atmosphere,” Studies in Topological and Vector Psychol- 
ogy, I (Univ. of Towa, 1940). See also, R. Lippitt and R. K. White, “The 


Effect of Laissez-faire, Democratic, and Autocratic Social Climates.” (Un- 
published study.) 

16 In subsequent experiments moving picture records were also used. See 
R. Lippitt and R. K. White, op. cit, (unpublished study, Univ. of Towa). 
ao in K. Lewin, et al., Studies in Topological and Vector Psychology, 
I niv, of Iowa, 1940), p. 107. 

1J. L. Moreno, Who Shall Survive? A New Approach to the Problem of 
Human Interrelations (Nervous and Mental Disease Publishing Co., Washing- 
ton, D.C., 1934). See also the journal Sociometry, which is largely devoted to 
experimental work in the field of interpersonal relations. 
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Mayo,!8 Whitehead, and Roethlisberger and Dickson, for 
example, report a large number of studies of experimental 
groups under carefully controlled conditions in a large in- 
dustrial plant. While these studies are primarily concerned 
with the productivity of the worker, they throw a flood of 
light on the conditions governing, and the results of, per- 
sonal relationships, incentives, freedom, self-direction, and 
morale! Important as an example of refined laboratory 
experimentation also is the work of Chapple and Arensberg 
in the measurement of social interaction of human beings.” 


O. OTHER EXPERIMENTATION AND REFINED OBSERVATION 


As a rule the term “experiment” is, as we have seen, used 
loosely in the social sciences to indicate any innovation or 
carefully planned observation. The larger of these under- 
takings, such as the so-called experimental schools and col- 
leges, have erred as scientific experiments by failing to keep 
sufficiently objective records of the conditions and the re- 
sults of their educational innovations. That is, it has been 
for the most part impossible to attribute any variation in 
their results directly to the experimental conditions because 
of failure to control adequately or allow for other factors. 
More successful in these respects because of the more de- 
tailed observation and recording of actual behavior have 
been the laboratory observation of nursery school children 
with respect to personality traits and elementary social be- 
havior. The following examples from the Institute of Child 
Welfare Research of Teachers College are illustrative of the 
type of controlled observation now being applied to human 


groups: 


Floor plans were drawn of the roof and indoor playrooms where 
the nursery school children spend the first two hours of the day 


18 Elton Mayo, The Human Problems of an Industrial Civilization (Mac- 


millan, 1938). 3 
19 T. N. Whitehead, Leadership in a Free Society (Harvard Univ. Press, 


1936). 
one. J. Roethlisberger and W. M. Dickson, Management and the Worker 
(Harvard Univ. Press, 1939). i 
21 For experimental attempts to measure morale, see L. D. Zeleny, “Sociom- 
etry of Morale,” Amer. Sociol. Rev., 4:799-808, 1939. A 
22E, D. Chapple with the collaboration of C. M. Arensberg, “Measurin; 
Human Relations: An Introduction to the Study of Interaction of Individ- 
uals,” Genetic Psych. Monographs, 22:3-147, 1940- 
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‘in relatively “free” (in the sense of minimally directed) play 
activities. In all these rooms, there is a variety of objects, some 
of which are large and stationary, e.g., jungle-gym, swings, sand: 
pile, piano, platform and steps, etc. ; others may change from day 
to day, but remain stationary during any particular day, e.g., 
rabbit cage, box on which nails are hammered, slide, etc. ; others 
are carried about by the children at will, e.g., blocks, doll-car- 
riages, kiddie-kars, and innumerable small toys. The larger, im- 
movable objects are indicated on the floor plans. Records each 
week have been obtained of the activities of each child of the 
lower age group by tracing his progress on these floor plans over 
a five-minute period. Every stop that he made was indicated, 
and timed, in seconds, with a stop-watch. His approach to and 
contact with other persons, adults or children, was ‘noted by ap- 
propriate symbols, as likewise the contact or approach of other 
people towards him. Spontaneous activity was indicated, as was 
also “directed” activity. The time any particular object was in 
his possession was shown by indicating when he picked it up, and 
when he dropped it. . . 

Another study on social behavior being made is closely related 
to the study by Miss Parten at Minneapolis. The technique here 
calls for the recording of every social grouping occurring during 
a given period, i.e., every time two or more children are together. 

e time each child comes to and leaves the group is indicated. 
If possible, the “function” or part played by each child in the 
group is also indicated briefly, and a check is placed by the child 
who has initiated the group, if this is obvious. Probably the only 
data available for statistical purposes are those indicating the time 
spent by each child in a group, which will make possible the com- 
putation of an index of group participation for each child. 
_ On the theory that laughter has a large element of the social 
in it, a study has also been made of the laughter situations in the 
nursery school. On a specially prepared blank, every laughter 
situation is recorded, with the names of the children in the group 
where the laughter occurred, and checks indicating which of the 
children “exposed” responded by laughing or smiling. Because 
of the interest inherent in this study of laughter, it is being made 
on three separate age groups. Thus, in addition to a laughter 
index (percentage of laughter to exposure) for each of the nursery 
school children, which will be used in comparison with the other 
social data, there is a genetic study of laughter in young children. 

Several thousand situations have been recorded, and these have 
been analyzed in their relation to chronological age, mental age, 
and other variables. 


"W. I. and D. S. Thomas, The Child in America, (Knopf, 1928), pp- 522° 
24. For more complete accounts of this kind of laboratory observation, see 
D. 5. Thomas, A. M. Loomis, and R. Arrington, Observational Studies of So- 
cial Behavior, I, (Institute of Human Relations, Yale University, 1933). 


“OBJECTIVE OBSERVATIONS 65 


These methods are largely dependent for their success on 
the development of tools and techniques of objective obser- 
vation, recording, and measurement. The motion-picture 
camera with sound recording has vast possibilities, as yet 
largely unexploited, in this field. By means of these devices 
it is now possible to observe and record intricate social situa- 
tions for subsequent repetition and detailed scientific study 
in the laboratory. 


P. SOCIAL EXPERIMENTATION AND THE STATISTICAL METHOD 


Outside of the laboratory, the method of controls is rarely 
applicable. * In social experiments especially, it is usually 
necessary to experiment on a single group, and to draw our 
conclusions from a comparison of the significant facts before 
and after a new factor is introduced. This gives rise to seri- 
ous complications owing to the lack of control of important 
conditions during the whole period of the experiment. But 
if we cannot inject or eliminate external conditions which 
may influence the outcome of an experiment, may not the 
same result be attained by measuring the effect of these in- 
fluences? In this way we can allow for the influence of each 
variable in the gross results, and conclude that any residuary 
difference in social conditions before and after a new factor 
was introduced must be due to this new factor. If we keep 
a careful record of all significant social conditions, for exam- 
ple, before a law is enacted, and keep strictly comparable 
records after its enactment, may we not draw conclusions of 
some validity regarding the results of the experiment? If 
we cannot take into consideration and make allowance for 
all the conditions that influence an event over a period of 
time, which is probably impossible, we may be able to get a 
“first approximation” to the truth. : et 

One of the commonest forms of such “experimentation 
in the academic field is to give tests before a course is taken 
and to give similar tests at the end of the course.” It is as- 
sumed, although no rigorous attempt is made to allow for 
other possible influences, that the change in information 
and attitude, if any, results from the course to which the stu- 


24 See, for example, S. Menefee, “T eaching Sociology and Student Attitudes,” 
Sociol. & Soc. Research, 22:545-56, 1933: 
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dent. has been exposed. Menefee has also used this method 
in determining the influence of stereotypes upon people’s 
expressions of attitude.** Statements of definite “class” or 
“party” significance were submitted to comparable groups 
and to the same groups after an interval of time. The state- 
ments were first submitted without any characterization such 
as “conservative,” “radical,” “Communist,” “Fascist,” etc., 
and were then submitted with these labels attached to the 
statements. The resulting changes in the degree to which 
different groups accepted or rejected the statements indi- 
cated the degree to which these groups were influenced in 
their attitude toward an issue by the presence or absence of 
a stereotyped label. Menefee used a somewhat similar tech- 
nique in studying the effect of emotional as compared with 
relatively non-emotional propaganda.*° 

A somewhat different and more highly controlled type of 
experiment to determine people’s attitudes and the factors 
that influence them is represented by such studies as that of 
Horowitz," who studied the development of attitudes in 
children toward the Negro by subjecting them to photo- 
graphs of play groups, eating groups, etc., some of which had 
Negroes present and others not. Choices by a child of the 
groups he preferred to join indicated the presence or absence 
of prejudice against the Negro. 

A very large literature of this general character reporting 
studies of changing attitudes under more or less controlled 
conditions has already appeared. These studies will prob- 
ably increase and improve in quality. 

Studies of the type described above, as well as those of a 
controlled laboratory sort, are dependent upon the use of 
statistical methods in recording and interpreting the find- 
ings. We are interested in observing the differences in be- 
havior under different circumstances. But when are ob- 


255, Menefee, “The Effect of Stereotyped Words on Political Judgments,” 
Amer. Sociol. Rev., 1:614-21, 1996. 

2S, Menefee, “Propaganda and Opinions on Foreign Policy,” Jour. of Soc: 

Ch., 11:393-404, 1940. See also the intensive study (forthcoming) of the 

ect of a presidential campaign in Erie County, Ohio, 1940, by Paul Lazars 
feld. See also p. 365 below, note 12, $ 

E L. Horowitz, “The Development of Attitude Towards the Negro, 
Archives for 4 03 No. 194, 1996. See Chap. VIII of the present text for 
2 summary of Horowitz's methods. 
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served differences attributable to chance, and when are, they 
almost certainly due to the conditions of the experiment? 
These are crucial questions in any experiment, and they can 
be answered only on the basis of the known laws of sampling, 
probability, and error. These are subjects which occupy a 
central place in any treatise on statistical methods, and al- 
though we shall make some references to those subjects in 
later chapters, a full discussion of the statistical methods in- 
dispensable to scientific experimentation in any field must 
be left to the treatises on that subject. The usefulness of 
simple statistical analysis in what Chapin ** has called ex post 
facto experimental studies should, however, receive some at- 
tention in the present connection. The approach is described 
by Chapin in part as follows: 

“From diverse experiments with the experimental method 
in education, psychology, and sociology, a pattern of practi- 
cable procedure has begun to emerge. It is our opinion that 
this pattern of procedure supplies the outlines of a long de- 
sired design for social experiments. To avoid misunder- 
standing of terms, let us state at the outset that by experi- 
ments we do not mean such trial and error efforts as the 
NRA, the AAA, the TVA, or similar gigantic social-economic 
reforms which are without controlled variables or adequate 
devices to measure their effects. We mean by experiment 
the observation of the changing relationship in an interval 
of time between two variables, meanwhile holding constant 
or controlling several other variables, which if uncontrolled 
might themselves explain or cause the effects. Suffice it to 
say that we shall attempt to describe briefly the efforts made 
to adapt the experimental method of physical science to the 
study of cause and effect or functional relationships in the 
social field. . . 

“This experiment? was based upon the high school rec- 

%F, S. Chapin, “A Study of Social Adjustment Using the Technique of 
Analysis by Selective Control,” Soc. Forces, 18:487, 1940. See also, b the 


same author, “Design for Social Experiments,” Amer. Sociol. Rev., 3:786-800, 
1938. : 
5 Helen F, Christiansen, “The Relation of School Progress, Measured in 
Terms of the Total Amount of School Attendance or Course Completion, to 
Su uent Economic Adjustment.” M.A. thesis, June, 1928, Univ. of Minne- 
sota Library. See also a similar study by N. G. Mandel, “A Controlled Analy- 
sis of the Relationship of Boy Scout Tenure and Participation to Community 
Adjustment,” M.A. thesis, Univ. of Minnesota Library, July, 1938. 
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ords and community experiences of 2127 boys and girls who 
left four St. Paul high schools in the school year of 1926, as 
graduates, or after having completed from one to three years 
of their high school course. There were only four St. Paul 
high schools at this date so that the experiment concérns the 
whole universe and not a sample. The year 1926 was taken 
because it was the earliest year for which comparable records 
on a large number of students were available. Moreover, 
since the follow-up was to the year 1935, there was thus a 
period of nine years in which these individuals could work 
out economic adjustments. 

“The working hypothesis of this study was: a greater de- _ 
gree of progress in high school leads to a correspondingly 
higher degree of economic adjustment in the community. 
Thus, the two variable factors whose relationship is to be 
measured are ones that lie at the heart of educational policy. 
Do we not spend millions of dollars annually to support 
high school education on the assumption that this hypoth- 
esis is a true statement of social relationship? Of course, 
there are other purposes of high school education besides 
economic adjustment, but certainly economic adjustment is 
one of the most practical considerations. 

“Having set up the working hypothesis of the experiment, 
it next becomes necessary to secure measures of the two vari- 
ables to be observed. The independent variable, school 
progress, was measured by the number of years of the high 
school course completed when the student left school in 
1926. Of the total 2127 boys and girls, 1130 graduated from 
high school in 1926 after completing four years and 997 

„dropped out in 1926 after having been in high school for 
the regular one or two or three years of the course. The 
measure of economic adjustment selected for the dependent 
variable was the percentage of shifts in jobs from 1926 to 
1935 that involved no change in salary or an increase in sal- 
ary as contrasted to the percentage of shifts that involved 
decrease in salary. 

“Now it is perfectly obvious that these are extremely crude 
measures. Factors of age difference as between those who left 
at the end of the freshman high school year and those who 
remained to graduate might affect economic adjustment. Sex 
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differences are often significant. Boys or girls from homes of 
higher status would have an advantage in gaining and hold- 
ing employment ’not possessed by children from homes of 
lower status. Differences in the nationality of the parents 
would influence the chances of getting a job. The neighbor- 
hood of the home from which the boy or girl came might be a 
factor in economic adjustment. The intelligence or mental 
ability of the different individuals would exert its influence 
upon securing a job, holding the job, and upon promotion in 
rank and salary on the job. This network of factors is also 
one of the interrelationship. An American boy whose father 
was a successful professional man living in a restricted neigh- 
borlidod would have several factors combined in his favor in 
comparison with the Jewish boy whose father was a clerk and 
lived in a deteriorated neighborhood. Since every one of 
these variable factors are recognized by sensible people as in- 
fluencing the course of individual economic progress, the way 
to obviate their disturbing influence is to control them. Here, 
therefore, we come again to our crucial problem of controls. . . 

“Each of these six factors, chronological age, sex, nation- 
ality of parents, father’s occupation, neighborhood status, 
and mental ability (by computing high school marks because 
the measurement by I.Q. was not available) was controlled. . . 

“It took a full year of systematic work in home visits and 
interviewing to trace the 1130 graduates of 1926, and the 997 
drop-outs of 1926, to their status of 1935. In this process, 
there was a shrinkage of 933 individuals in the total. Of this 
number lost, 21 were deceased, 42 had moved out of town, 
575 could not be traced in the follow-up, and 295 had records 
so incomplete as to make comparison worthless. Thus, of the » 
original 2127, there were located a group of 671 graduates 
and a group of 523 drop-outs. . . 

“The process of gaining control began with the selection 
from the control group of a child who was then matched with 
another child from the experimental group for sex and na- 
tionality of parents. This reduced the two groups to smaller 
groups with identical proportions in sex division and in the 
distribution of parental nationality. At this point the con- 
trol of factors by identity through individual matching had 
to be supplanted by control through the correspondence of 
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frequency distributions on each factor. The reason for this 
change was that the conditions of individual identity on a 
factor by matching eliminated so many cases that the sample 
dwindled in size at an alarming rate after each new control 
was set. ? 

“The correspondence of frequency distributions on a given 
variable factor is a far less rigorous control of this variable 
factor than is identity by matching individual with individ- 
ual, but it is probably the most frequently used method of 
control in contemporary experimental studies. Its selection 
and application on grounds of reducing the attenuation of 
the sample is justified when the results of the experiment are 
significant. When the results are not significant and no other 
explanation is reasonable, then it is necessary to resort to the 
more expensive process of individual matching, but this 
means repetition of the experiment, or at least increasing the 
number of cases observed. 

“Setting the six controls reduced the final sample to a total 
of only 290 cases, 145 in the control group and 145 in the 
experimental group, a decline of 86.4 *percent from the 
original group of 2127 students! This is the price of ob- 
servation under conditions of control. The longer the list 
of controls and the more vigorous their method of applica- 
tion, the smaller the final sample. At this point, some statis- 
ticians may say that we end up with a sample too small to be 
representative of conditions in a large original group, but let 
me remind them that an experiment is designed to obtain a 
homogeneous and ‘pure’ sample. In fact, our sample has 
been purged of the very factors that made for heterogeneity 
in the original group and whose presence obscured the real 
relationship between the factors we set out to study. To dis- 
cover the real relationship between a magnet and iron, we 
must have ‘pure’ iron and not iron ore that is complicated 
by the presence of other minerals and metals, which it would 
be if ‘representative’ of the original ore. Homogeneity, not 
representativeness, is the essential condition to the discovery 
by experiment of a real relationship between two factors: 
Consequently, if the present experiment shows that there is a 
relationship between the amount of high school education 
and the degree of subsequent economic adjustment, this re- 
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lationship is more likely to be a real relationship than is the 
case under conditions in which the social situation is compli- 
cated by uncontfolled factors. Within certain limits (the 
limits of the given experiment), the homogeneity of the two 
sub-samples is more important than their representativeness 
of variable factors that originally obscured the relationship. 

“Finally, if we turn now to the differences in economic 
adjustment of the control group of drop-out students and the 
experimental group of graduates we find that 88.7 percent of 
the graduates experienced no changes in salary or had in- 
creases in salary from 1926 to 1935, whereas 83.4 percent of 
the drop-outs reported increases or no changes in salary from 
1926 to 1935. Putting it the other way, only 11.3 percent of 
the graduates suffered salary decreases in this period, whereas 
16.6 percent of the drop-outs suffered salary decreases. This 
is a small difference of 5.3 percent. 

“When the length of high school education before drop-out 
is analyzed, we find that 74.1 percent who left school in 1926 
at the end of one year of high school had salary increases or 
no changes in salary during the period 1926-1935; and of 
those who ended two years of high school, 85.1 percent were 
adjusted economically ; and 89.6 percent of those who ended 
three years of high school were adjusted. Thus, in general, 
the longer the period of high school education, the higher the 
percentage of adjustment in the economic terms used as a 
criterion. None of these differences are statistically signifi- 
cant as single differences. The important point is that they 
are consistent and in the same direction.*° It is a matter of 
opinion whether small differences that are corroboratory and 
in the same direction are as important as differences that are 
large and statistically significant. Our opinion is unequivo- 
cally that small differences in the same direction may be as 
important as one large difference that is statistically signifi- 
cant. There are two reasons for this opinion. First, social 
phenomena are complex and not likely to show large differ- 
ences because of the configurational character of the social 
situation. The separate factors in a social situation are usu- 

30 For an important discussion of the statistical aspects of this point, see 


C. C. Peters, “Note on a Misconception of Statistical Significance,” Amer. 
Jour. Sociol., 38:231-36, 1933- 
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ally functionally related. Second, the conventional tests of 
the significance of sampling are based upon the theory of 
random samples, and in experimental wořk, as we have at- 
tempted to show, it is the terminal homogeneity and purity 
in the sample, rather than initial representativeness of hetero- 
geneity, that is important in demonstrating the real relation- 
ship between two variable factors. 

_ “Thus [this] experiment, in spite of the crudity of meas- 
uring devices used, was nevertheless a successful experiment 
if the argument in favor of corroboratory small differ- 
ences is accepted, because these small differences in salary 
changes, percentage voting, average number of interests in 
activities, average number of additional years education, 
and occupational class attained, all support the hypothesis by 
showing evidence of better adjustment with every increase in 
the amount of high school education. Doubtless there will 
be those who will observe that the experiment has merely 
proved that which we expected to be true. But justification 
by expectation and wishful thinking is quite a different thing 
from corroboration by factual evidence. On the other hand, 
there may be critical persons who will observe that the re- 
sults of this experiment are almost too good to be true. To 
all such, we may reply that the evidence is available for criti- 
cal appraisal in the thesis, and that in any event, we do not 
claim one successful experiment proves a relationship to be 
true. The experiment needs to be repeated and the results 
verified before the principle is established on factual evidence 
rather than on wishful opinion, . . 

“In this experiment, we have found that control of such 
variable factors as age, sex, nationality of parents, occupa- 
tion of father, neighborhood status, and high school marks, 
is sufficiently adequate control of the social situation to lead 
to definite scientific results. What would have happened if 
we had exercised a more rigorous control by the device 
identity secured through individual pairing for every vari 
able factor instead of relying on the rough control of corre- 
spondence between distributions in the case of the four fac- 
tors of age, father’s occupation, neighborhood status and 
high school marks? .. . 

“The answer to this question [has been found] by actually 
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pairing against each individual in the control group of drop- 
outs, another individual from the experimental group of 
graduates of the’Same sex, identical chronological age, the 
same parental nationality, the same father’s occupation, the 
same neighborhood status and identical high school marks. 
This process of more exact control reduced the total from 
290 to only 46, of which 23 were in the control group of drop- 
outs, and 23 in the experimental group of graduates. When 
the economic adjustment of these two contrasting groups is 
examined, the following results emerge: g2 percent of the 
experimental group of graduates had salary increases or no 
change in salary from 1926 to 1935, whereas only 58 percent 
of the control group of drop-outs experienced salary increases 
or no salary change for the period. This is a difference of 
34 percent in favor of the graduates. In comparison, the dif- 
ference was only 5.3 percent for subgroups of the 290 cases 
under less rigorous controls. Thus, the labor of more exact 
control was amply repaid because it demonstrated in a very 
decisive manner and in terms of a difference statistically sig- 
nificant, the superior economic adjustment of the experimen- 
tal group of graduates. In this case, the real relationship was 
more clearly demonstrated by more rigorous controls.” 31 
The above example will give some idea of the importance 
of statistical method in social experiments. Not only is this 
method essential in recording and measuring the variables in 
experiments, but it provides a technique for determining the 
relationships of factors and the degree of contingency be- 
tween them. This is all that even the comparatively exact 
sciences can do, As Pearson has pointed out: “That the 
universe is a sum of phenomena, some of which are more, 
others less closely contingent on each other is the conception 
wider than that of causality, which we may at the present 
time draw from our widening experience... No phenomena 
are causal; all phenomena are contingent, and the problem 
before us is to measure the degree of this contingency, which 
. . . lies between the zero of independence and the unity of 
causation.” * Therefore, concludes Mills, “Association and 
31 F, S. Chapin, “Design for Social Experiments,” op. cit., pp. 786, 789-94- 


32 Karl Pearson, The Grammar of Science, (Third Edition, A, & C. Black, 
London, 1911), pp. 173-74- 
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correlation are the terms which replace causation, as proba- 
bility and approximation replace the concept of certainty.” * 

By the method of sampling, averages, frequency distribu- 
tion, and selective control as described above, the statistical 
method disposes of the apparently insuperable difficulties of 
the enormous number and variability of social phenomena. 
By the method of correlation it provides a technique of meas- 
uring the degree of contingency between social phenomena. 
The importance of these techniques will be elaborated in 
later chapters. It is to the statistical method that we must 
look in the social sciences for an escape from the seemingly 
insurmountable obstacles to precise methods of experimen- 
tation in sociology. 

While social experimentation will probably always lack the 
rigor of the experiments of the physical laboratory as far as 
artificial control of the conditions is concerned, the same 
result will be achieved through refined techniques of measur- 
ing and allowing for the influence of these conditions. In 
the practical applications of the physical sciences such calcu- 
lations and allowances are always made. Such factors as alti- 
tude, temperature, etc., are variables always present, and which 
have to be measured and allowed for, before the “scientific 
laws” describing the situation are valid. The technique for 
holding constant certain factors in order to study the influ- 
ence of some other factor without artificial manipulation of 
the acttial social conditions involved has already been devel- 
oped to some degree in the statistical device of partial corre- 
lation. Social experimentation, therefore, instead of aiming 
to isolate the experiment by an actual elimination of all the 
complicating physical and social factors will instead merely 
allow for them through the methods of measurement pro- 
vided by statistics, 


Q. SUMMARY AND CONCLUSIONS 

The purpose of this and the preceding chapter has been, 
first, to emphasize the need of a scientific approach to the 
study of group behavior, and especially the need of objec- 
tivity in observation and recording. The unaided senses are 


"On Measurement in Economics,” ch. 2 of The Trend in Economics, 
Edited by R. G. Tugwell, (Knopf, 1924). 
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subject to very grave shortcomings, (a) because of their in- 
herent limitations, and (b) because of the influences of bias 
and prejudice. The observation of social phenomena is 
especially subject to these influences. Moreover, the require- 
ment of science that the essential facts of a situation shall 
appear the same to all competent observers has led to the 
development of instruments and devices for objectifying, 
standardizing, and accurately measuring observations. But 
so far, these devices and methods have been developed almost 
exclusively in the physical sciences. There the experimental 
method has been our chief reliance. 

In the observation of social phenomena there exist certain 
grave obstacles to the use of the experimental method, and 
consequently to the development of refined methods of ob- , 
servation, measurement and causal inference. These ob- 
stacles spring, first, from the opposition of society to any 
active interference with the determining conditions of a so- 
cial problem. Questions of human rights, freedom, morals, 
and other subjective, sentimental, and emotional considera- 
tions are involved. Second, there is the very difficult if not 
impossible matter of securing isolation or proper control of 
all the variables in a social experiment because (a) it usually 
has to be performed in a social medium, and (b) all the vari- 
ables may not be suspected. 

On account of these obstacles in the social sciences to the 
type of experimentation carried on in the physical'sciences, 
the most promising and productive form of social experimen- 
tation has become that provided by statistical techniques. 
Such techniques permit us to measure the influence of the 
known variables, and thus to allow for their influence in any 
given problem. This is equivalent to holding these variables 
constant, while the operation of some one factor is observed. 
Thus the essential conditions of a scientific experiment are 
achieved without the actual physical or social manipulation 
of the conditions that are significant to the experiment. 
Through the statistical methods of sampling, averaging, and 
correlation, generalization and causal inference become pos- 
sible. Through such techniques also, tools of social measure- 
ment and prediction have developed in such forms as scoring 
schedules, business barometers, business and social indices, 
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nd actuarial tables. In these devices for forecasting the so- 
Gal weather we have the counterparts of the barometer, the 
thermometer, and the other instruments of physical science. 
While these considerations clearly E e the possibility 
of scientific method in the field of social phenomena, it should * 
be frankly recognized that as yet many of the necessary tech- 
niques are highly undeveloped. Furthermore, we have not 
fully realized the need in our social organization of using to the 
fullest extent the techniques now available. Important groups 
of social facts are either not collected at all, or exist in the 
crudest form. The increasing adequacy of the United States 
Census and of the statistical records kept by other govern- 
mental agencies, such as those of the Department of Com- 
merce and the Public Health Service, represents hopeful de- 
velopments for the future. States and cities supplement these 
sources through censuses and record-keeping of their own, 
but again the incompleteness, incomparability, and general 
inadequacy of these records greatly limit their usefulness for 
scientific and experimental purposes. However, efforts are 
now being made to correct these as governmental agencies 
seek statistical advisers. The long time usually required for 
social experiments makes it of the greatest importance that 
social statistics be standardized and that their comparability 
in various geographic areas and over periods of time be main- 
tained. Finally, the lack of development in statistical method 
itself inflposes certain limitations on its use in more refined 
scientific generalizations. But up to the present our tech- 
nique of handling data already collected far surpasses in re- 
finement the data themselves and the methods of securing 
them. A comprehensive program of careful and standardized 
recording of data is the greatest immediate need for the be- 
ginning of a scientific understanding and control of society.” 
There are many able and hard-working people who are 
impatient with the painstaking methods of science and the 
seemingly futile recording of detailed data over long periods 
of time. These workers overlook the fact that often the chief 
reason for the difficulties of solving some of their problems is 
4 The cipanelon and improvement of official statistics will be reviewed in 


Chap. 11, also W, F. Ogburn, “Statistical Trends,” Jour. Amer. Statis. 
Asso. Proceedings of the Centenary Celebration 1839-1939, 35:252-60, 1040. 
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that we have not yet formulated the principles to apply to 
their problems. These, principles can be arrived at on 
through collection and manipulation of data by constantly 
more refined methods. 

In this discussion of the promise of scientific method in 
the social sciences there is no desire to deify science. The 
limitations of scientific method, and the tentative nature 
of the conclusions reached by it should be clearly recog- 
nized. In some quarters 3 the thesis i§ advanced that sci- 
ence is to the great masses merely another folklore to which 
we look for explanation and guidance. This new folklore 
has its own priesthood, namely the scientists. The belief 
of the common man in science is in all important respects of 
the same kind as his belief in other folklore. He can no 
more investigate for himself the basis of his belief in science 
than he can investigate the basis of his belief in other lore. 
He accepts both on faith, if at all. Thus, his belief in the 
rotundity of the earth is essentially similar to his belief in the 
revelations of Moses. He accepts both on authority. It may 
be objected that the theologian quotes authority as a neces- 
sity and enjoins belief as a duty, while the scientist quotes 
authority purely as a matter of convenience and can mect dis- 
belief with demonstration. That is, he who doubts the find- 
ings of the scientist may after a long period of training him- 
self become a scientist and thus may become privy to the 
sense experience on which scientific belief rests. But in the 
same way a doubter of theological doctrines may himself be- 
come a theologian and examine the basis of these doctrines, 
whether from ancient records or direct revelation. The means 
of verification of either type of lore is beyond the reach of the 
great majority of men, and is fundamentally of the same 
nature. Both rest, in the last analysis, upon the similarity of 
the responses of a number of individuals to given phenomena. 

To many people this analogy is considered as proof that. 
science is no more entitled to our respect or reliance than any 
other folklore. This view is further supported by the fact 
that the history of science even in its most highly developed 
fields is marked by a rapid discarding of old postulates and 
concepts for new. From these considerations the conclusion 

35 See C. E. Ayres, Science the False Messiah, (Bobbs-Merrill, 1927). 


78 SOCIAL RESEARCH 


A 

is drawn that, since nothing is final or absolute, and since all 
theories must be taken on faith, one folklore is as good as 
another. But this is clearly an erroneous conclusion. ‘There 
are three principal tests of the validity of a method or of a 
conclusion. The first is the agreement of competent inves- 
tigators; the second is the agreement and congruity of the 
observations and conclusions with each other; the third test 
of validity lies in the utility of a given method in solving our 
problems. In all three respects, the methods and conclusions 
of science have proven, not their infallibility, but their supe- 
riority. It is by these criteria that the relative value of folk- 
lore must be determined. It is by these criteria that science 
must make its appeal. And it is by these criteria that the 
true scientist always regards his conclusions as tentative, and 
to be discarded whenever a new conclusion or a new method 
promises to meet the test of utility more adequately. 

Whatever view one may take regarding the possibility of 
the development of exact social sciences as set forth in these 
chapters, all will agree to the immediate and practical value 
of the collection of certain social data for guidance in the 
everyday problems of administration of current affairs. The 
principle of ordinary bookkeeping, at least, will need no de- 
fense. To the technique of collecting such data as is gener- 
ally conceded to be essential, we now turn our attention. 

As for the more ultimate aim of exact social sciences, it may 
be suggested that in the current skepticism on the subject of 
the actual usefulness of scientific method in the solution of 
concrete social problems, science is at least entitled to the 
benefit of the doubt. As Chase has said: “So far as we can 
see, the future . . . lies with the man of science — the social 
scientist, the engineer. For upwards of one hundred thou- 
sand years, the mystics, the medicine men, the orators, the 
spellbinders . . . the personally crafty, the dealers in . . - 
rationalized sophistry, have imposed upon others their shoddy 
inferences, conclusions and even controls. It is difficult to 
see how the man of science can do worse.” #* 

% Stuart Chase, The Tragedy of Waste, (Macmillan, 1930), p. 279. 
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R. SUGGESTIONS FOR FURTHER STUDY 


1. For a more general discussion of the problems of objec- 
tive observation of social phenomena, see G. A. Lundberg, 
Foundations of Sociology, pp. 17-22, (“The Visibility and 
Objectivity of Social Phenomena”). See also, A. F. Bentley, 
Behavior Knowledge Fact, (Principia Press, Inc., Blooming- 
ton, Ind., 1935), Chap. 21 (“The Visibility of the Social”). 

2. For a comprehensive digest of the large literature of 
concrete studies in social psychology with special reference to 
experimental studies, see G. & L. Murphy and T. Newcomb, 
Experimental Social Psychology (Rev. Ed., Harper, 1937): 

3. Excellent brief collateral reading for the sections on ex- 
perimental methods will be found in the three experiments 
(one of which is summarized in the text) described in F. $. 
Chapin’s “Design for Social Experiments,” Amer. Sociol. Rev., 
3:786-800, 1938. Further description of this procedure 
may be found in the following subsequent articles by the 
same author: “A Study of Social Adjustment Using the 
Technique of Analysis by Selective Control,” Social Forces, 
18:476-87, 1940; “An Experiment on the Social Effects of 
Good Housing,” Amer. Sociol. Rev., 5:868-79, 1940. 

4. For a profound consideration of the logic of scientific 
inquiry, see John Dewey, Logic: The Theory of Inquiry, 
(Holt, 1938), Part IV. 

5. Herbert Spencer’s The Study of Sociology, (Appleton, 
1903), remains a classic on the subject of the present chapter. 
On the different types of bias likely to be present in sociologi- 
cal study, see especially ch. 8-12. 

6. F. S. Chapin, “Some Problems in Field Interviews 
When Using the Control Group Technique in Studies of 
the Community”; Amer. Sociol. Rev., 8:63-68, Feb. 1943. 

7. E. Greenwood, Experimental Sociology (N. Y., Kings 
Crown Press, 1945) 


CHAPTER III 


TERMINOLOGY, UNITS, AND CLASSIFICA TION 


The rough working definitions of the dictionary serve to throw some light 
on the meaning of isolated words, but it is when the words are actually used 
in assertions that their inherent indefiniteness leads to doubt, and to this is 
due nine-tenths of the wrangling that goes on in all the Council chambers of 
the country. —F. W. Westaway. 


The preceding chapters have emphasized that one of the 
principal requirements of a scientific fact is that it shall ap- 
pear the same to all competent observers. This requirement 
makes it necessary (1) that the conditions of observation be 
standardized, and (g) that a standardized set of symbols and 
terminologies be employed to describe our sense experiences. 
The first of these conditions was treated in the chapter im- 
mediately preceding. The present chapter considers the sec- 
ond requirement. 


A. THE IMPORTANCE OF DEFINITE TERMINOLOGY IN SCIENCE 
ka 


» The futility of much argument and discussion resulting 
from a poor definition of the terms employed could be illus- 


trated at great length. A typical example from Professor 
James will be sufficient here. 


Some years ago, being with a camping party in the mountains, 
I returned from a solitary ramble to fad everyone engaged in a 
ferocious metaphysical dispute. The corpus of the dispute was a 
squirrel —a live squirrel supposed to be clinging to one side of 
a tree-trunk ; while over against the tree's opposite side a human 
being was imagined to stand. This human witness tries to gê- 
sight of the squirrel by moving rapidly round the tree, but no 
matter how fast he goes, the squirrel moves as fast in the opposite 
direction, and always keeps the tree between himself and the 
man, so that never a glimpse of him is caught. The resultant 
metaphysical problem now is this: Does the man go round the 
squirrel or not ?? 


Obviously, as Professor James soon pointed out, this whole 


dispute hinged on the meaning of the term go round. If we 


1 Scientific Method, (Blackie, London, 1912), p. 8. 
2 William James, Pragmatism, (Longmans, 1997), P- 43- 
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mean by going round to occupy successively positions, say, 
east, south, west, and north, of the object in question, then 
certainly the man goes round the squirrel. If, however, we 
mean by going round to occupy successively positions in front 
of the squirrel, to his right, behind him, and to his left and 
finally in front of him again, then it is equally clear that the 
man does not go round the squirrel, for the squirrel in the 
problem is constantly changing his position so as to keep it 
constantly relative to the man. Now our point is that the 
great majority of disputes have at their basis some such mis- 
understanding or conflicting use of terms. 

The importance of language? as an instrument of adjust- 
ment among human beings is usually emphasized in all gen- 
eral treatises on sociology and psychology. It is pointed out, 
for example, that a map is a highly valuable symbolic repre- 
sentation, provided its pattern, order, and sequences corre- 
spond to the actual terrain over which we must travel. It is 
a corresponding handicap, if it fails to indicate where the 
rivers and the mountains are, if it confuses the order in which 
they occur, or otherwise fails to correspond to the conditions 
to which we must adjust. Sometimes a local map which may 
be adequate for most kinds of travel within its own borders 
turns out to be very misleading in the light of a larger per- 
spective. In the same way the verbal systems and orientations 
of a primitive, primary-group society may turn out to be 
grossly inadequate in a national or world society of secondary 
group relationships. 

We are not here primarily concerned with the vast amount 
of misunderstandings among people in daily conversation on 
account of the inadequacy of their language as a means of 
communication. We are interested mainly in the same prob- 
lem as it affects the social sciences. As an illustration of the 
type of impasse into which an inadequate symbolic system is 
likely to force us, we may refer to Zeno’s well-known paradox 

8 For a good popular treatment of the subject, see Stuart Chase, The Tyr- 
anny of Words, (Harcourt, 1938). An excellent brief treatment is Leonard 
Bloomfield, Linguistic Aspects of Science, Encyclopedia of Unified Science, 
Vol. 1, No. 4, (Univ. of Chicago Press, 1939). See also, E. A. Esper, “Lan- 


guage,” Chap. 11 in Handbook of Social Psychology, Edited by C. Murchison, 
(Clark Univ. Press, 1935), p- 456. 
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of Achilles and the tortoise. If Achilles allowed the tortoise 
a head start in a race, it was suggested, the former could never 
overtake the latter. Whatever common sense and an actual 
trial might show, the language and the logic in terms of 
which the paradox was set up left no escape from the strange 
conclusion, The tortoise is allowed one hundred yards start. 
Although Achilles runs ten times as fast as the tortoise, the 
latter will obviously have gone an additional ten yards by the 
time Achilles has run the first hundred yards. When Achilles 
has run this additional ten yards, the tortoise will still be one 
` yard ahead. By this process of yerbalization about the race 
it is obvious that, while Achilles is constantly getting nearer 
to the tortoise, he can never quite overtake it. 

If instead of the above method of symbolizing this event 
we resort to a simple algebraic or geometric statement‘ of a 
type unknown to the Greeks, we come to a very different 
conclusion. We find by the latter methods (under given as- 
sumptions) that Achilles definitely overtakes the tortoise at 
the end of 111.11 seconds. What is more important, this 
result also corresponds to what happens in an actual experi- 
ment. This verifiable result of the latter method suggests 
that perhaps it is preferable as a symbolic device for such 
mundane matters as, for example, planning the dispatch of 
trains. The former method may be more advantageous in 

creating” thrilling situations in imaginative literature, just 
as the trips of rocket-ships to the moon in current cartoons 
and movies have unquestioned recreational value. They may 
be regarded as quite harmless unless someone actually attempts 
to take off, with present equipment, on such a journey. We 
are interested here in symbolic systems which will be reliable 
_ #See the geometric demonstration in L, Hogben, Mathematics for the Mil- 
lion, (Norton, 3937), p. 89. In practical adjustments to our immediate en- 
vironment we set aside symbolic Tacos which do not correspond 
to the observed facts, But in our adjustments to secondary group situations 
with which we have never had direct personal experience, we rely entirely 
upon symbols, and if they do not faithfully represent the concrete adjust- 
ments to be made, the results are disastrous in the same way that it would be 
disastrous in certain circumstances to rely on the reasoning involved in the 
story of Achilles and the tortoise. Many of the pronouncements of social 
philosophers are dangerous in the same way. The symbols, the reasoning, 
and the conclusions seem irrefragable on paper, but th simply do not cor- 
respond with what occurs in actual social relations. Many proverbs are of 


this character. They represent not statements of facts, as is commonly as- 
sumed, but somebody's wishful thinking as to what ought to occur. 
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charts to adjustments beyond the universe of the symbols” 
themselves. 

The symbols man uses at first to represent the world are 
oral, and very little scientific development is possible so long 
as this is true. Many of the things that scientists wish to com- 
municate simply cannot be adequately transmitted through 
oral language. The structure of the idea which scientists 
must communicate is so complex that it cannot be matched in 
a succession of acoustic stimuli. Consequently, the language 
of science must increasingly consist of written graphic symbols, 
which provide an enduring instead of an immediately van- 
ishing stimulus, and which offer possibilities of arrangement 
(tabulation, etc.) that cannot be communicated in oral lan- 
guage. In devising written symbols, the tendency in science 
is to develop, as soon as possible, special symbols representing 
highly abstract, standardized, and ordered responses called 
quantitative terms instead of the emotionally charged words 
employed in everyday common-sense communication. The 
indefiniteness of meaning of such symbols as “many,” “much,” 
“more,” “few,” “less,” “least,” as well as avowedly “qualita- 
tive” terms such as “good,” “bad,” “symmetrical,” “level,” 
“heavy,” “light,” etc., led to the invention of number systems 
and the symbolic division of various subject-matters into units 
and scales. These are highly standardized and readily verifi- 
able response categories. 

So much has science been dependent upon: this develop- 
ment that we find the advancement of science largely con- 
tingent upon the development of mathematics. It is on the 
symbolic uniformity of mathematics that we rely for further 
power to describe the uniformities in the behavior of phe- 
nomena. The description of these uniformities constitutes 
the essence of science. 


B. CONFUSION OF TERMINOLOGY IN THE SOCIAL SCIENCES 


In the description of social phenomena very little use has 
yet been made of the language of science, namely, mathe- 
matics. Asin the early stages of every science, we still employ 
in the social studies the vague and indefinite terminology 
of common parlance drawn from the folklore (common- 
sense experience) of the past. That terminology, consisting 
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as it largely does of the concepts of primitive mysticism, is 
not well adapted to the uses of science. But the meaning of 
words, being the growth of centuries, is not easily abandoned 
or changed. Even in the comparatively well developed physi- 
cal sciences, there is still a large vocabulary of common-speech 
designations which are retained as traditional units. Thus 
we have in physics such terms as horse-power, foot, candle- 
power, etc. The difficulties inherent in such terms have been 
obviated by assigning to these traditional words mathematical 
definitions that are entirely arbitrary. A horse-power, for ex- 
ample, originally meant the amount of energy which a horse 
is capable of exerting. Clearly, this amount would vary 
within wide limits according to the size and physical condi- 
tions of the horse. But the scientific definition of the term 
as the amount of power required to lift 33,000 pounds one 
foot high in one minute at sea level permits a very much 
more accurate designation of the idea. One of the principal 
reasons for the comparative indefiniteness of the social sci- 
ences is that in these studies we still employ the terminology 
drawn from common-sense experience without having as- 
signed to these terms a meaning approaching in definiteness 
the physicist’s definition of horse-power. 

The results of this lack of a standardized terminology in 
the social sciences are apparent in nearly all controversies 
and discussions in these fields. Such common terms as group, 
Society, community and public are frequently used inter- 
changeably and with different meanings by different writers. 
The result is much fruitless argument over the meaning of 
such terms as public opinion, social mind, community, con- 
sciousness, etc., and a great vagueness as to which of several 
common meanings of a word should apply in a given case. 
In the social scienges, most of the important terms have sev- 
eral meanings. Usually the meaning is dual —one being the 
meaning of the term as employed in common parlance and 
the other a broader or a more limited meaning which, by 
common consent, has been adopted by the students in a par- 
ticular field. Thus culture is used in common speech to in- 
dicate the so-called refinements and etiquette of polite society, 
usually including some familiarity with belles lettres and 
other fine arts. In anthropology and sociology, culture usu- 
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ally includes the entire social accumulation of behavior, cus- 
toms and institutions, but there is much vagueness and dif- 
ference of opinion even in these professions as to the exact 
meaning of the term. A college student once questioned the 
author’s use of the term behavior to include both good and 
bad behavior. This student had always thought of the term 
as applying only to socially approved conduct. Likewise, the 
-confusion in the meaning of the word primitive has caused 
the proposal of the term pre-literate for societies which have 
not yet developed a written literature. Again, in economics 
there is a wide disparity between the popular and the econo- 
mist’s definition of production, consumption, etc. 

In 190% Small listed forty-eight terms and concepts which 
at that time, in his opinion, constituted the terminology of 
sociology.® In 1931 Eubank, after an extensive survey of the 
literature of the field, compiled a list of 332 such terms and 
concepts. Commenting on his list, Professor Eubank says : 


The very extent of such a list as this is confusing; but it is 
indicative of an underlying confusion that is still more significant. 
It reveals strikingly how far the sociologists are from agreement 
upon the very terminology itself. The youth of this science is 
nowhere more clearly shown than in the indefiniteness and lack 
of uniformity of its vocabulary. Different writers use the same 
term in different senses. Conversely, the same idea is labeled 
differently by different writers.” 


As an illustration of this confusion, Eubank has tabulated 
the lists of major concepts employed by each of ten leading 
American sociologists, as found in eight general texts. He 
finds a total of 146 items. Only 63 out of the 146 are found 
in more than one list ; not one appears in all eight lists ;,only 
one term appears in as many as seven lists; only two others 
are found in six lists; and only eight other terms appear in 
five of the lists. Professor Eubank summarizes the matter as 
follows: 


Expressed in a sentence, this means that out of eight volumes 
of sociological theory, selected because their authors are men 
whose recognized standing in American Sociology entitles them 


5 Albion W. Small, General Sociology, (University of Chicago Press, 1905), 


br 29. 
6È. E. Eubank, The Concepts of Sociology, (Heath, 1932), Chap. 4. 
T Ibid., p. 43. 
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to be regarded in an especial way as oy sapere of the field, 
not a single concept occurs upon which they are agreed as a main 
heading for the materials which they have singled out as most 
important to discuss ; and over 55 per cent of the concepts use 
as main headings (a total of 83 out of 146) are found in not more 
than one of the entire eight tables of contents.§ 


The incomparability, and hence the diminished scientific 
utility, of much current social data, because of the lack of ob- 
jective definition of units, is strikingly illustrated in the field 
of social work. McMillen reports the results of a study in 
this field which revealed “a complete absence of uniformity 
among agencies in their classification of major care and minor 
care cases... Moreover, the data afforded no evidence of 
uniform policy by which minor care cases could be differen- 
tiated from the miscellaneous services not counted as cases. 
The responses showed unmistakably that a contact which 
might be counted as a minor care case in one agency would 
in another be excluded from the service count.”® A further 
similar check on 25 agencies reputed to have high standards 
of work “revealed a lack of uniformity of practice very little 
less marked than that found in the entire group.” The move- 
ment during the last decade for standardized reporting of 
social statistics (see pp. 402-406) has undoubtedly greatly im- 
proved this situation. It is obvious that intelligent discus- 
sion, not to mention statistical studies, depend on a more 
rigorous definition of our terms. 


C. THE CRITERIA OF SCIENTIFIC TERMINOLOGY 


It is clear that no field of inquiry can be reduced to a very 
definite sciénce as long as its terminology and concepts are 
thus badly defined. The need for terminology understood 
and used in the same way by all the students in the field is, 
therefore, a great present need in all the social sciences, and 
perhaps especially in sociology. This gives rise to the ques- 
tion how such a terminology is to be developed. The ulti- 


$ Ibid., p. 46. 
A, W. McMillen, Measttrement in Social Work, (Univ. of Chicago Press- 


1939), p. 48. See also A. W, McMillen and Helen R. Jeter, “Statistical Tet 
minology in the Family Welfare Field,” Social Service Review, 2:357-84- 


TERMINOLOGY, UNITS, CLASSIFICATION 87 


mate test of the value of a particular term or concept must 
be its usefulness for the purpose at hand, namely, the descrip- 
tion of behavior so objectively as to be subject to corrobora- 
tion by other persons, usually by reference to standards exter- 
nal to the individual organism. Terms and categories should 
be regarded merely as working tools, and not as entities in 
which inhere magical properties. They should, therefore, 
be modified or abandoned as they serve, or fail to serve, our 
purpose. But at the same time, the requirements of science 
make necessary a certain uniformity. This uniformity can be 
secured only by general agreement between the workers in a 
field to use certain symbols to designate certain facts or rela- 
tionships. Thus the American Statistical Association once 
had a Committee on Terminology to secure uniformity in 
the use of Greek letters and other terms as statistical symbols. 
In 1938 a Committee on Conceptual Integration was formed 
in the American Sociological Society to consider ways and 
means of remedying the confusion reviewed above. Similar 
action is badly needed in other social sciences. 

The criteria by which a term is to be accepted or rejected 
must again be determined ultimately by the utility of the re- 
sults. Professor Eubank suggests four tests of a true scien- 
tific concept : 1° 

(1) Is the term reasonably precise? Does it convey an ex- 
act and clear-cut meaning? 

(2) Does the term convey only one final idea? 

(3) Is the term perfectly general ; that is, always employed 
in the same sense wherever it is used ? 

(4) Is the idea fundamental to its particular field ; that is, 
essential for its inclusive interpretation ? 

While several of these criteria are themselves somewhat 
subjective, they may serve as tentative guides to the formula- 
tion of a useful terminology. Uniformity in the use of terms 
thus arrived at can be secured only by common agreement. 
This is important. On the other hand, if the specific mean- 
ing of a term is objectively defined, and if it denotes a sig- 
nificant behavior-segment or operation, it matters little what 
symbol we agree upon. E 


10 Op. cit., p. 398. 
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D. THE NATURE OF OPERATIONAL DEFINITIONS IN SCIENCE 


Perhaps the most elementary way of defining a word is to 
point to an example of it or to go through a motion and 
simultaneously say the word used to designate that motion 
(or set of motions). That is, we teach a child the meaning 
of the word “book” by pointing to one or several books in 
succession; we leap into the air and say, “jump”; and so 
forth. The ease and clearness with which we can communi- 
cate the meaning of a word in a given case depends here upon 
the convenience with which such motions can be performed. 
The clearness with which we can communicate the general 
or class meaning of these words depends on the facility with 
which the motion in a given case will be recognized as similar 
to these objects or motions in the cases of other books and 
other jumps. * 

The ease with which different words can-be defined in this 
way, and consequently the uniformity of meaning which dif- 
ferent users of a word attach to it, varies greatly. The diffi- 
culty increases as we desire to communicate the meaning of 
abstract words, designating complicated relationships between 
eventsand things. Such words as electron, acceleration, honor, 
status, cooperation, etc., are of the latter type. The problem 
of definition of scientific concepts is largely of this nature. 

We have already pointed out what physicists have done 
with such terms as horse-power when they have taken over 
such a folk word as a designation for some aspects of the phe- 
nomena with which they work. As we have seen, physicists 
invent instruments which behave in certain ways — which 
“register” — when applied in certain ways. Clocks, thermom- 
eters, ammeters, etc., are of this character. Scientists are 
today content to define such concepts as time, temperature, 
electricity, as that which is registered or measured by these 
instruments. Questions about the “ultimate nature” of these 
phenomena are regarded as obsolete and unscientific. Scien- 
tists agree to designate each degree or kind of behavior which 
their instruments indicate by specific words or other symbols. 
These words so defined may then be used to build up more 
complicated words, the definition of which is, however, al- 
ways reducible to the readings of standardized instruments. 
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Thus, velocity is a rate of motion, acceleration is a rate of 
change of velocity. Even such a word as “electron,” which 
is.popularly thought of, perhaps, as a small billiard ball, is 
now defined by the physicist by means of a differential equa- 
tion. Such definitions are called operational definitions. 

There is nothing mysterious or magical about the word 
“operational.” It is merely a word we use to designate a type 
of communication which “gets across” with high reliability. 
Thus, a recipe for a chocolate cake may be regarded as an 
operational definition of such a cake. Now a recipe is, of 
course, only a group of words, and, as such, is no different 
from any other group of words constituting a definition. 
Strictly speaking, we might contend that it is the actual per- 
formance of baking a cake which constitutes its definition. 
To the extent that the words of the recipe designate accu- 
rately all the steps in the performance, however, we call the 
recipe an operational definition of the cake in question. The 
possibility of writing such a recipe obviously depends upon 
the prior existence of well-recognized meanings of such words 
as “sugar,” “chocolate,” “a cupful’” (standard measuring cup), 
“hot oven” (so many degrees, etc.), as well as of all the words 
designating operations like “mix,” “pour,” etc. The test of 
all this would lie in the similarity of the performance and the 
product when numbers of different people are provided with 
the recipe and follow it exactly. 

Operational definitions, then, are merely definitions which 
consist as far as possible of words clearly designating perform- 
able and observable operations subject to corroboration. 
Thus, they may consist of (1) physical manipulations, such as 
baking a cake or reading a thermometer ; (2) objective verbal 
designations of these manipulations; or (3) verbal designa- 
tions of symbolical or mental operations, such as the defini- 
tion of cube root or other mathematical operations. 

Now the degree to which the above criteria can be satisfied 
varies considerably according to the stage of development 
which a science has reached. The prevalence of operational 
definitions, for example, in physics is undoubtedly due to the 
ready availability and standardized characteristics of the in- 
struments for measuring time, space, temperature, voltage, 
etc. It is not contended, therefore, that other definitions 
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which are only imperfectly, slightly, or not at all operational 
may not be valuable in the early stages of a science, and on 
the frontiers of well-established sciences. They may be use- 
ful as first approximations, pending more mature develop- 
ments. Highly perfected operational definitions are goals 
toward which to strive, rather than tools to be hoped for or to 
be conjured up ready-made at the outset of an inquiry. This 
has always been recognized by the proponents of operational 
definitions. 

One other argument against the feasibility of operational 
definitions in sociology should be briefly considered. This 
is the familiar contention that it is not only the maturity of 
a science, but the nature of the subject-matter which deter- 
mines whether operational definitions are practicable or de- 
sirable. In this connection, we encounter again several cru- 
cial misconceptions which we remarked upon in the first 
chapter. It is frequently assumed that some phenomena, es- 
pecially social phenomena, are intrinsically subjective, others 
intrinsically objective, and that consequently some phe- 
nomena are amenable to measurement and operational treat- 
ment, others not. ‘As we have already pointed out in the first 
chapter, these categories (objectivity, measurability, etc.) are 
in science not regarded as the inherent characteristics of any- 
thing. They are merely designations of different ways in 
which man responds to phenomena and communicates his 
responses. When an observer communicates his observations 
so that others can corroborate his reports, we call such data 
objective. As for measurability, this is again merely a par- 
ticular way of responding to phenomena. A large vocabulary 
of such words as more, less, high, low, heavy, light, etc., desig- 
nate informal and non-numerical measurements.!2 The more 

11 Cf. John Dewey; “. . . I submit that if the words personal and imper- 
sonal were uniformly prefixed to things (‘objects’ in the sense in which they 
are linguistically identified with things) instead of the words ‘subjective’ and 
‘objective,’ an artificial, because gratuitously instituted, problem would be 


eliminated.” The Philosophy of john Dewey, (P. A. Schi pp. Editor, North- 
western Univ., 1939). P: 543- 

12 Cf. John Dewey, Logic., The Theory of Inquiry, (Holt, 1938), p. 202 : 
“All comparison is of the mature of measurement. .. The only dificult 
standing in the way of redbgnition of the equipollence of comparison an 
measurement is the fact that the results of many measurements are stat 
qualitatively, not in numerical terms.” See also my Foundations of Sociology, 
PP- 187-42. 
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formal measurements are, as we shall see, symbolic inventions 
to designate gradations in different aspects of phenomena, and 
they are as subject to development with respect to social phe- 
nomena as with respect to any other data. As we also saw in 
Chapter I, this means that the imaginings, thoughts, feelings, 
and phenomena of “consciousness” generally, when observed 
and reported in as checkable form as any other laboratory 
observations, either by the observer upon himself or upon 
others, are as amenable to scientific study as any other body 
of data. 


E. OPERATIONAL DEFINITIONS IN THE SOCIAL SCIENCES 


We may now summarize this discussion of definitions with 
two concrete illustrations of the kind of work that is now be- 
ing done to develop more specific, objective, and operational 
definitions of sociological terms. Consider such a word as 
“morale.” What does it mean? One source gives the fol- 
lowing definition: “... the degree to which the individual 
feels competent to cope with the future and achieve his de- 
sired goals. [Negatively it is characterized by] . . . feelings 
of insecurity and discouragement.” 1° 

Note that this definition consists of words about a word, 
and that the entity defined is said to be an individual's feel- 
ing about his situation. In other words, “morale” would 
appear to be one of these “subjective,” “inner,” “mental” phe- 
nomena frequently supposed to be beyond the reach of scien- 
tific methods. The attempt to use the above definition as it 
stands in an actual investigation of whether some groups have 
more morale than others, would doubtless result first of all in 
verbal quarrels over the definition. Some would suggest an- 
other wording, because as it stands, the definition “does not 
cover,” or “leaves out” some things that “really are” morale. 
Since the number of different associations which different 
individuals may have with a word are very numerous, pro- 
longed discussion, numerous definitions, and no conclusion 
as to the “right” one would be the result. Clearly, what is 
needed, therefore, is the designation of some behavior (ver- 


13E. A. Rundquist and R. F, Sletto, Scoring Instructions for the Survey of 
Opinions, (Univ. of Minnesota Press, 1936). My discussion of this illustration 
is adapted from F. S. Chapin’s excellent article, “Definition of Definitions of 
Concepts,” Soc. Forces, 18:153-60, 1939- 
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bal, emotional, or any other) which as individual items or as 
a configuration, we agree to call “morale.” These items must 
then be arranged in a scale or test so that people who are 
characterized by a high degree of the behavior we have agreed 
to call morale will, when measured by the test, receive a high 
score, and vice versa. 

The methods of constructing such tests will be discussed in 
a later chapter. Suffice it here to say that the process de- 
scribed above was actually carried out by Rundquist and 
Sletto, and that with the resulting instrument they could de- 
termine objectively (i.e., verifiably, in terms of the specific 
behavior taken into consideration by the scale) the difference 
in morale between, for example, employed and unemployed 
groups. Chapin™ has constructed on similar principles an 
instrument for defining and measuring the social status of 
American urban families. The methods of constructing and 
validating this scale will also be discussed in a later chapter 
(Chapter IX). Our concern here is merely with the principle 
involved, as it bears upon the technique of defining socio- 
logical terms objectively. 

It is sometimes alleged that this procedure results in a 
“degradation” of concepts; that the terms as operationally 
defined (a) no longer mean what they meant before, and that 
(b) an operational definition neglects some aspects of be- 
havior which many people have hitherto included in it (¢.g+» 
“morale”). Both these objections may be admitted. But 
they do not undermine the value of the procedure. It is un- 
doubtedly true that words like “morale,” “socio-economic 
status,” “intelligence,” etc., defined in terms of scores on à 
specific scale will have a very much narrower (or at least a 
more or less different) although a perfectly definite, meaning 
than these words have today in general folk usage. But this 
does not mean that all the folk meanings which these words 
now have would be denied, or ignored. We may admit that 
some or all of the phenomena to which these words refer in 
folk language and literature are possibly quite as important 
as those included in the objective and specific definition pro- 
vided by the scale.’ These other meanings, in so far as they 


° 


on S. Chapin, Contemporary American Institutions, (Harper, 1935) 
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are relevant to the problem we have set ourselves, must be 
similarly defined operationally by other words, symbols, and 
scale scores. If the objective and operational definitions of 
words leave out “shades” of meaning which were formerly 
attached to the words defined, the solution is to develop 
equally clear definitions for these neglected aspects, either as 
a gradation of the main definition or as an independent word 
denoting a definite aspect. 

This will mean that common words when adopted for sci- 
entific purposes (e.g., horsepower, foot, water = H,O, etc.) 
will undoubtedly have meanings quite different from their 
folk connotations. But this refinement — “degradation” some 
would call it—is the very essence of science. To argue that 
“the more precise and unambiguous the terms become, the 
less valuable they are” on account of the numerous folk con- 
notations which scientific terms leave out, overlooks the fact 
that all the neglected aspects are subject to equally objective 
definition. The sensing and objectifying of increasingly sub- 
tle aspects of phenomena is the endless task of science.® 


F. THE UNITS OF SCIENCE AS CONSTRUCTS OF CONVENIENCE 


The same difficulties and confusion which we have ob- 
served above with respect to terminology generally, apply to 
the more specific problem of units of measurement in the 
social sciences. For, although society is a whole, with all its 
aspects and parts inextricably interwoven, the limitations of 
our senses require that we study it in parts. The same is true 
of every field of science. This gives rise to the difficult ques- 
tion, “Into what units or parts is it permissible to break up 
the whole?” Here again, our final criterion must be the use- 
fulness of the units for the purposes at hand. And here again 
we encounter many conflicts with conventional, traditional, 
and metaphysical units and classifications. 

One of the commonest controversies in the social sciences 
over the matter of units is the controversy over the “reality” 
of “society,” or the “group,” as a unit. Because of the time- 
honored focus of attention on the individual as thè measure 

15 For elaboration of this subject, see my Foundations of Sociology, pp. 58- 


77. See also Chap. TX of the present text, pp. go2-306. See also “Is Ambi- 


guity in Some Social Science Concepts Desirable?” Amer. Jour. Sociol. 
(forthcoming 1942). 
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of all things, the assumption by some sociologists of the group 
as an equally “real” entity, has been vigorously attacked both 
within the field of sociology and outside. This controversy 
again betrays the same failure noted above with respect to 
terminology generally, namely, the failure to recognize that 
terminology and units are merely convenient delimitations of 
phenomena for certain purposes of study, and that as such, a 
term or a unit must justify itself purely by its utility. Thus 
the individual organism is a bundle of parts, a functional 
arrangement, which we have found it convenient for many 
purposes to consider as a unit. For certain other purposes, 
however, it is wise to break up this unit into smaller parts, 
represented by the numerous units and classifications of physi- 
ology, psychology, chemistry, and physics. Usually these sub- 
divisions, when first proposed, have been vigorously opposed, 
and their “reality” denied. It is not surprising, therefore, 
that the attempt to synthesize individuals into a new unit 
called a “society,” and to claim for it the same reality as for 
its component units meets with vigorous opposition. The 
reason is the same in both cases, namely, that the new units 
call for a readjustment in our mental focus—our habits of 
thought — which is painful. It is therefore alleged that the 
new ,unit is “unreal,” “artificial,” and otherwise objection- 
able. Professor Wallis has admirably disposed of this par- 
ticular controversy in the following excellent passages : 


A group is an entity when it shows persistence of qualities and 
when the parta of which it is composed are mutually interde- 
pendent. In such a case it functions as a unity, and so is a 
reality... The reason why the concept of a human individual 
1s so easy, is that we start with it, not acknowledging the syn- 
ome n the realm of individual psychology we do the same 

ng. 

. + +I do not object to this procedure, but merely point out 
that it is a procedure, and that the procedure itself usually escapes 
our knowledge just because it is an overly familiar procedure. 
Far from objecting to it, I would be ready to admit that a solar 
system is as much a reality as is a grain of sand, and that it is as 
true to say that a solar system gives us grains of sand, as to say 
that grains of sand give us a solar system. In other words, the 
things discovered by analysis no higher reality than those 
discovered by synthesis. pte a and AR ah are merely differ- 
ent methods of approaching the study of things, for becoming 
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acquainted with the nature of the world in which we live, or 
which lives in us. 

Our justification for these conclusions will be our ability to 
predict social Nasa ery by a study of social phenomena, and 
the corresponding inability to predict them by a study of in- 
dividual psychology.?¢ 

Another aspect of the same failure to recognize the purely 
“artificial” or utilitarian nature of all terminology and units 
is the controversy over the relationship of parts to the whole. 

The question of the relationship between the individual 
and the group is still very much in the foreground of socio- 
logical discussion. In psychology the reverse of the same 
question is a matter of controversy between the Gestalt school 
and other factions, the “Gestaltists” being dedicated to the 
proposition that a study of individual responses in isolated 
parts does violence to “reality,” inasmuch as it is the organ- 
ism responding to the situation as a whole which behaves. 
The proponents of the so-called case method in sociology and 
the statisticians are haggling over the same difficulty when the 
case workers declaim against the statisticians’ practice of lump- 
ing together large numbers and extracting an average. This 
average, it is solemnly affirmed, lacks “reality.” There is 
little point or purpose to any of these discussions. Nor is 
there any possibility of settling the controversy as long as the 
terms and concepts employed are thought of as absolute en- 
tities —real in their own right—instead of convenient cate- 
gories into which, for certain purposes, we have seen fit to 
divide the phenomena of the universe. The alleged greater 
reality of one category as compared with the other, is purely 
the result of convention and familiarity. The only justifi- 
cation for any unit or classification is its utility for our 
purpose. 

That units of any order are merely convenient constructs, 
and not metaphysical entities with a mysterious reality of 
their own, is a fact which cannot be too strongly emphasized. 
For the social sciences are still largely under the influence of 
philosophical word-puzzles, which causes them to waste much 
time and energy on entirely futile discussions. Thus, in addi- 
tion to the metaphysical attitude toward units noted above, 


16 Wilson D. Wallis, “The Independence of Social Psychology,” Jour. Ab- 
normal and Soc. Psych., 20:147-50, 1925: 
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the whole-part relationship is receiving serious discussion of 
a type to edify any philosopher. May one analyze either the 
whole or its parts without destroying the reality of either? 
As we have seen, our knowledge of the world and its parts 
is conceptual, and analysis as such consists of the substitution 
of one set of symbols for another because it is easier in some 
sense and for some purposes to reason with (manipulate) 
one set of symbols rather than another. Thus, mathematics 
affords no end of illustration of the fact that, although two 
expressions may be “equal” or “identical,” they lend them- 
selves to widely different types of manipulation. The ques- 
tion whether a particular set of parts in a given relationship 
is to be identified with the whole depends entirely on the 
purpose for which the identity is to be asserted. The validity 
of each method of symbolization is a question of the function 
it performs. For some purposes, a mere algebraic sum of the 
parts is equivalent to the whole. For other purposes, it is 
quite inadequate, as further specification of the relationships 
of the parts may be required. To use Copeland's illustra- 
tion, “as a sum of so many sticks, one cord, a neat wood pile 
in the back yard is identical with those same sticks scattered 
hit or miss over the yard. But as a whole of the type which 
we designate as a neat pile, there is obviously an important 
distinction.” #7 Logical distinction must not be considered 
the same as physical dissection. “Conceptual analysis permits 
the parts to be conceived as they are when parts of the living 
whole, and not as they would be if physically isolated. . . An 
analysis is a veritable construction of the person who makes 
it. The parts are not logically in the whole. In constructing 
an analysis, a person builds up a new —and perhaps better — 
way of responding to a thing—a new way of dealing with it 
‘mentally’... The merits or demerits of a physical or any 
other type of analysis are for a scientist questions of the suc- 
cess they yield in predicting or controlling.” 


G. REQUIREMENTS OF DESIRABLE UNITS 


With this view of the nature of units before us, we may 
summarize briefly some of the concrete principles of deter- 


1 Morris A. Copeland, “An Instrumental View of the Part-Whole Relation.” 
Jour. Phil, 24:q6-104. 1987. 
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mining units for specific purposes. While it has been empha- 
sized above that one of the principal requirements of a term 
or a unit is that it should mean the same to all minds, this 
does not mean that a generally accepted definition is the only 
requirement of a satisfactory unit. Four principal require- 
ments may be summarized as follows: (1) Appropriateness, 
(2) Clarity, (3) Measurability, (4) Comparability. Let us con- 
sider briefly the meaning of these requirements. 

1. Appropriateness. The appropriateness of a unit must 
be determined entirely by its usefulness for the purpose at 
hand. It follows that a unit which is appropriate for one pur- 
pose may be entirely inappropriate for another. Thus, if the 
purpose is to determine the increase or decrease in the 
amount of cultivated farm land, account of the number of 
“farms” would not necessarily reveal the answer sought. If 
it is desired merely to find out the increase or decrease in the 
number of farms, however, the unit “farm” is satisfactory. 
Likewise, if the purpose is to determine the cost of living in 
a particular place, a determination of the wholesale prices of 
products consumed might lead us widely astray. The appro- 
priate unit here is obviously retail prices. Again, if the pur- 
pose of a study is to compare the birth rate of two cities, the 
population as a whole of each city is not an appropriate unit. 
Nor is the total female population an adequate unit. The 
best base for the comparison is the number of married women 
of child-bearing age (15-50 or 15-45)- With respect to geo- 
graphic units also, the purpose of the study determines the 
appropriateness of a particular unit. For some purposes, 
such as the computation of the density of population, a city 
block or a square mile may be a satisfactory unit. But if 
these areas contain less than 500 population they would be 
very inappropriate units for the computation of reliable and 
comparable birth, death, morbidity and other vital rates. 
Likewise, the city ward may be a suitable unit for certain 
political purposes, but it is usually very inappropriate for 
purposes of health and social administration. 

2. Clarity. The importance of objective definition of units 
so that they will be used and understood in the same way by 
different people and at different times has been emphasized 
at some length above. The general criterion of clarity is that 
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a unit shall mean the same thing to all people concerned. 
The following illustrations of lack of clarity in units in com- 
mon use will emphasize still further the need of specifying 
exactly the meaning of units employed : 


Unit 
Price 
Age 
Birth 
Accident 


Crime 


Dwelling-house 


Possible Questions of Clarity x 
Retail? Wholesale? At what time? Com- 
ability of quality of articles of same name? 
alue of money at different times? 


On date of enumeration? At last birthday? 
At next birthday ? 


All births, including still births? Live births 
only? Legitimate births only? 


Injury ansing any loss of time? Injury caus- 
ing specified loss of time? 


Felony only? Any offense against the law? 
Any offense causing arrest? Any offense re 
sulting in conviction ? 


A church, if sexton lives in it? A garages if 
chauffeur lives in the second story? Is a 
duplex one or two dwelling-houses ? Hotels, 
apartment houses and dormitories ? 


Does it include attic, pantry, kitchen, base 
ment, bathroom, Closers 


Where a n sleeps? Works? Eats? 
Location of employer? 


Child less than one year? Less than two 
yoman than six months? Less than one 
mon! 


“Is or is not a five-acre market garden a 
farm? ... If a man owns two eighty-ace 
tracts half a mile apart and works them both, 
do they constitute one or two farms? If his 
hired man lives on one eighty [acre tract] does 
that change the status? If he rents one eighty 
{acré tract] to a tenant what is the effect 


How are these problems of clarity to be decided when we 
confront them in an actual investigation? The princip 


W, I. King, Elements of Statistical Method, (Matthillan, 1921). P- 44 
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consideration must, of course, be the purposes we wish to 
achieve. If we are interested,for example, in the cost'of liv- 
ing as reflected in family budgets, we decide to consider retail 
rather than wholesale prices. If, as is usually the case, we 
wish to compare our findings with those of other studies, we 
have to be guided.largely by the definition of units used in 
these other studies or at least by the possibility of translating 
the units into terms of each other. In short, all of the ques- 
tions raised above have to be settled finally on the basis of the 
anticipated or experimentally secured results of using the 
different possible definitions, in relation to the type of result 
we desire. 

Suppose we define crime, for example, so as to include 
traffic violations in a congested area where nearly everyone 
owns an automobile. If on this basis we find the community 
to be highly inclined toward criminal behavior as compared 
with a neighboring slum where few of the inhabitants drive 
automobiles, the result probably does not serve the purpose 
for which the investigation was undertaken. If so, we should 
change the definition of crime so that its application in our 
study will serve our purpose. Ifa certain definition of “case” 
by a social agency, which measures the extent of its activities 
in terms of “cases,” does not apply to three-fourths of the 
clients of the agency, this definition of “case” should doubt- 
less be changed so as to include most or all of its clients. In 
any investigation, units must be adapted to the results we 
wish to achieve. To be sure, the more generally useful a 
unit is the better adapted it is for scientific purposes. But 
units are never inherent in nature or dictated by data. Units 
are merely convenient symbolic designations of aspects of 
phenomena. The assumption that such units as “man” or 
“sheep” are “natural” units because they lend themselves 
readily to a number of general purposes and to counting, is 
quite fallacious. The aspect of “men” and “sheep” in which 
we are interested (e.g., the size of a group. or a herd, ie., the 
bulk or volume of a kind of entity called “group” or “herd”) 
when we use them as units is always implicit in the context. 
This implicitness in the definition of some units as com- 
pared with explicitness of definition in scientific units gives 
rise to an assumed fundamental difference designated by 
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such terms as “natural” (e.g., man, sheep, etc.) vs. “artifi- 
cial” (é.g., degrees, calories, etc.).1° 

When we pass from the problem of defining with clarity 
such concrete terms as those illustrated above to the so- 
called “abstract” or “relative” terms like “normal,” “crime,” 
“pathology,” “marital adjustment,” etc., we are confronted 
with the same problem. We tend to approach these prob- 
lems, however, with less confidence and in a less practical 
way than those discussed above because of a certain feeling 
of mystery and fear lest by an explicit and arbitrary defini- 
tion we miss “the thing itself,” “the real nature of the phe- 
nomenon.” Yet our “feeling” as to the “nature” of the en- 
tity represented by a word is merely the sum total of the 
associations which we have developed for that word. We do 
not deny the legitimacy or the importance of these associa- 
tions, including the vague and subtle ones which may be 
held only by ourselves. It is only by explicitiy specifying 
these associations, however, that we can make clear to others 
just what meanings and associations we do attach to a word. 
If these meanings are very varied, we may find it in the in- 
terest of mutual understanding to agree to apply the original 
word only to certain well-defined aspects of the phenomenon 
under consideration, and to adopt other terms to cover the 
excluded associations. Or we may leave the original word 
as a non-scientific catch-all symbol for everybody's associa- 
tions with a word and substitute a new word for the special 
aspect in which we are primarily interested. 

Proceeding in this manner, we may recognize at least two 
meanings of the word “normal.” Very frequently it is per 
haps used to indicate a state of affairs which the user regards 
as desirable, or even ideal, as, for example, when he con- 
siders his health normal. Such a definition would obviously 
be relative to each individual, and would vary greatly 1? 
content from person to person. A more objective definition 
of “normal” is, therefore, to define it as simply the usual or 
ordinary for a given group as measured by a statistical moee, 
or the variations within, say, one standard deviation above 
and below the mean. Subnormality and supernormality 
would then represent any variation respectively below an 


19 For elaboration of this subject, see my Foundations of Sociology, PP- 68-79 
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above this central or most common group. This assumes 
that a scale indicating degrees of the quality under’ consid- 
eration (intelligence, ability, industrial production, marital 
adjustment, or anything else) has been constructed and that 
the data have been classified according to this scale. “Pa- 
thology,” “crime,” “insanity,” “maladjustment,” and other 
such words are then definable as a specified degree of varia- 
tion (say the lower two percent) in the socially disapproved 
direction. Corresponding degrees of variation in the socially 
approved direction are called “genius,” “distinguished,” etc.*? 
The fact that scales for the measurement of such phenom- 
ena are evaluations relative to some culture does not affect 
their logical foundation or utility as instruments for dis- 
criminating and categorizing the responses of men to differ- 
ent degrees of an aspect of a phenomenon. In this sense all 
human responses are “evaluative.” That is, we “value” the 
weight of one article as more than that of another, and in- 
struments exist for the more accurate discrimination and 
classification of these “valuations.” Likewise, we “value” 
one liquid as sweeter, cooler, or darker than another. Thus, 
“valuating,” which is frequently supposed to be a unique 
feature of our responses to social phenomena, is now seen 
to consist of nothing but discriminations (differential re- 
sponses) which we also make toward all physical phenomena. 
It is true that these value scales of physical qualities usually 
are much more universal and interculturally valid, thanks 
largely to the diffusion of standardized instruments for physi- 
cal measurement. But in the end, all such judgments of 
degrees of physical qualities, whether with instruments or 
without, are relative to the sensory equipment of man. So- 
cial valuations and social scales are likewise relative to this 
sensory equipment as conditioned by group living in a given 
culture. It is not necessary to argue that a tape measure and 
an attitude scale are the same in any other respect than as 
instruments through which it is possible to discriminate re- 
liably significant gradations in situations to which we have 
to adjust. This process is usually called measurement. 
i i ject, S. C. Dodd, Dimensipns of 
ioe feet ogg een age oe poi G. Lundberg, “Societal Pathology 
and Sociometry,” Sociometry, 4:78-97, 1941- 
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3. Measurability. To say that measurability is a desirable 
characteristic of scientific units is then merely another way 
of designating a relatively high degree of clarity in defini- 
tion. The definition of the valuative, subjective, intangible 
qualities of social phenomena is to be solved as the same . 
problem has been solved in the other sciences. Height, 
weight, temperature, etc., are defined for scientific purposes 
only in terms of the instruments we have devised to measure 
those aspects of situations to which we need to make increas- 
ingly refined adjustments. A mature social science will like- 
wise define intelligence, morale, status, maladjustment, crime, 
etc., in terms of the stantlardized instruments by which we 
measure certain aspects of situations which confront us. There 
will be no mystery as to exactly what segments or aspects of 
behavior ate taken into consideration in the calibration of 
these instruments. Neither will there be any denial of the 
possible importance of other aspects which these instruments 
neglect, and which will therefore be legitimate objects of 
further construction of instruments. The technique of this 
construction will concern us in a later chapter. 

We have pointed out that in science we take measurement 
to be merely a more specific and refined way of responding 
to events. Measurability, therefore, is regarded not as a pe 
culiar quality or characteristic possessed by some phenomena 
but not by others. We regard measurability instead as 4 
technical problem of devising the necessary instruments for 
categorizing our responses with that degree of reliability and 
objectivity which distinguishes measurement from other types 
of response. Definition and measurement tend in science to 
involve the same operations. That is, measurement is a Way 
of defining by describing, and distinguishing accurately, just 
as the most accurate definition tends to be in terms of meas 
urements. Since Einstein, at least, physicists have been con- 
tent to define time as that which is measured by clocks, and 
to say that force is that which makes pointers move across 
dials, etc. Previously physicists, too, were of the impression 
that they must define, describe or “know” what these things 
“are” in some more final sense than that which their meas- 
urement yields. Following out this notion they constructed 
ponderous paragraphs of familiar words whose reassuring 
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sound seemed to provide the necessary definitions. When 
they faced the concrete problems of their science, however, 
it was found that these definitions contributed nothing to 
the solution. All that was found useful was the measure- 
ments of the entities so loquaciously defined. t 

Ordinarily we limit the term measurement to those de- 
scriptions which we can state in numbers of some kind. We 
strive in science, therefore, to state our observations in units 
which, avhen counted, represent a reliable and useful ac- 
count of the event under observation. 

The obstacles to measurement alleged to inhere in the 
nature of the subject-matter of the social sciences are merely 
the difficulties which inhere in the type of units into which 
that subject-matter is at present ‘divided. They are not con- 
cretely observable units, but are assumptions and abstrac- 
tions drawn from philosophy and metaphysics. The general 
criterion of objective measurability of a unit is the possibil- 
ity of securing similarity of responses to it from most or all 
people. The “strength of religious interest” in an individual 
is not “directly” measurable. But his religious behavior (in- 
cluding his symbolic responses) — his church membership, at- 
tendance, financial support, and other activities engaged in, 
or abstained from, for religious reasons — is objectively meas- 
urable. When methods of accurately describing these types 
of behavior in terms of numerical degrees are devised the 
task of measuring them has been solved. 

4. Comparability. Facts are of significance only in rela- 
tion to other facts. It is of primary importance, therefore, 
that units be not only clearly defined, but that care be exer- 
cised lest the definition change over periods of time within 
which it is desired to make comparisons. A vast amount of 
error in statistical studies in the social sciences flows from a 
failure to observe this criterion. Comparisons of wages, 
wealth, and income in money units which vary widely in 
purchasing power between the periods and places compared 
are common. Numerous attempts have been made to dis- 
cover trends in social maladjustment through comparative 
figures of divorces, arrests, convictions, etc., without taking 
into consideration the changes in laws, and law enforcement 
during the period and in the localities considered. The Fed- 
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eral Census itself has changed its ‘definition of some units 
from decade to decade, e.g., such units as family, city, farm, 
establishment, age, etc. Unless allowance is made for these 
changes in definition, conclusions drawn from a comparison 


of the figures are bound to be fallacious. It is undoubtedly . 


necessary to change the definitions of units from time to 
time, although the resulting incomparability of apparently 
comparable figures is likely to result in considerable incon- 
venience or, more seriously, in false conclusions. If the units 
in each case have been defined in objective and measurable 
terms, however, the problem is less serious because the differ- 
ence between the old and the new units can then be deter- 
mined and the necessary adjustments made. For the general 
criterion of comparability is that the symbols representing 
the units to be compared should in all significant respects 
correspond to the aspects of phenomena which we wish to 
compare. Statistical devices such as percentages, ranks, index 
numbers, standard deviation units, and the components of 
factor analysis are good illustrations of instruments designed 
to render comparable similar aspects of widely diverse sub- 
ject-matter. 


H. CLASSIFICATION 


In the above discussion of the definition of terms and 
units, the general principles of scientific classification have 
already been set forth. A definition of any kind is, of course, 
only a kind of classification. The fundamental reason for 
classification is that science is invariably concerned with dis- 
covering and describing uniformities in large numbers of 
phenomena. And the process of determining these uniform- 
ities consists of separating the whole mass of phenomena 
into categories on the basis of their likenesses and differences- 
Certain characteristics of an object are selected as significant 
for certain purposes. Objects possessing these characteristics 
are classified together ; objects which do not possess the sig- 
nificant characteristics are excluded from the class. These 
likenesses and differences may be quantitative or qualitative 
—in terms of “place, time, circumstance, quality, magnitude, 
activity, behavior or function, coexistence or sequence.” 


nF. H. Giddings, “Societal Variables,” Jour. Soc. Forces, 1946, 198$ 
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The number of classifications which can be made out of 
almost any body of'phenomena on these bases is exceed- 
ingly large. What characteristics we shall use as the basis 
of classification is always to be determined by the end 
sought. “The test of the scientific character of a classifica- 
tion,” says Westaway, “is the number and importance of the 
properties which can be regarded as common to all the ob- 
jects included in a group.” ° But the importance of a prop- 
erty or an attribute will be governed by the purpose we have 
in view. 

For example, in a housing survey, we could classify houses 
according to color, and secure a fairly complete classifica- 
tion. But both the number and importance of statements 
applicable to the resulting groups would be negligible. Like- 
wise, in a study of the death rate in a population, a classifica- 
tion of that population as blondes and brunettes would be 
practically useless. The classifications which we need in this 
connection are age, sex, nationality, race, etc. “Te often hap- 
pens that natural groups must be founded not on the more 
obvious but on the less obvious properties of things, when 
these are of greater importance; and this suggests what is 
certainly the fact, that an extensive knowledge of the proper- 
ties of objects is always necessary for making a good classifi- 
cation of them.” 2 The difficulties of adequate classification 
should be ‘fully recognized. But they are difficulties which 
are not insuperable, once we abandon the attempt to make 
classifications which will hold for all purposes, and recognize 
all classifications as valid only for specific purposes. “Per- 
formed consciously or unconsciously, the act of classification 
is indispensable to, and accompanies every scientific infer- 
ence. A mind is orderly or slovenly according as it does or 
does not habitually and accurately classify the facts with 
which it comes in contact. ‘The success of an investigation, 
the worth of a conclusion, are in direct proportion to the 
fidelity to this principle and the exhaustiveness with which 
the process is carried out.” ** 

The formal rules for valid classifications may be summar- 

22F, W. Westaway, op. cit, p. 215. 


23 Ibid., p. 216. - AAY 
“Frank Cramer, The Method of Darwin: A Sttidy in Scientific Method, 


McClurg, 1902, p. 98. 
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ized as follows: (1) Classifications must be mutually exclu- 
sive. A classification certainly loseayp usefulness if any of 
the data to be classified can properly be placed in more than 
one category. This means that there must be only one basis 
for division at a time, and that the basis must be as objective 
as possible. The attempt to classify human behavior on the 
basis of interests, wishes, or other subjective categories, for 
example, is futile because a given behavior may be attrib- 
uted to a number of different assumed desires or interests. 
(2) Closely related to the principle of exclusiveness is the 
principle that a valid classification must be exhaustive, that 
is, include all the data studied. Thus a classification of popu- 
lation as undef twenty-one years and twenty-one and over 
would be exhaustive, but a classification on the basis of white 
or Negro might leave a large portion unclassified. (3) Finally, 
a classification must be appropriate, that is, suited to the end 
sought. Schoolboys might be classified on the basis of the 
number of syllables in their names. But it would probably 
serve no useful purpose. 

The skillful classification of data is an extremely difficult 
and important step in the scientific method. It calls for all 
the imagination of the artist and the philosopher as well as 
intimate knowledge of the data to be classified and of the - 
rules of logic and mathematics governing formal classifica- 
tion It is therefore not a subject to be adequately dealt 
with in any textbook, but is rather to be attained by experi , 
ence and experiment with actual data. We must limit our- 
selves here to a simple example of the type of considerations 
and problems that arise in such a practical and common proj- 
ect as the classification of occupations. For more adequate 
treatment of the subject, the student is urged to consult some 
of the sources mentioned at the end of the chapter. 

In 1938, the American Statistical Association and the Cen- 
tral Statistical Board appointed a committee to prepare a list 
of occupational titles under which the entire working popu- 
lation of the nation could be classified. Some of the prob- 

% For a good example of formal classification and a discussion of its uses, 
see Karl Menger, “An Exact Theory of Social Groups and Relations,” Amer. 


Four. Sociol., 43:797-98. 1998. See also, G. A. Lu „, “Some Problems of 
Group Classification and Kiore,” Amer. Sociol” Rev. §°351-60, 1940- 
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lems and questions that arose are indicated in the following 
extracts : 7° Ar , 


_ The general point of view from which the Committee worked 
is that an occupational classification is fundamentally an instru- 
ment for dividing a working population into groups which are 
more or less homogeneous with respect to the occupations in 
which they are or have been engaged. Such groups are often used 
primarily as a means of describing a population, either for statis- 
tical purposes (as in the case of the Bureau of the Census), or for 
administrative purposes (as in the case of a placement office). 
More frequently they are used for purposes of statistical analysis, 
in which occupation is one of a number of variables. The typi- 
cal analytical problem of this kind is concerned with some social 
or economic variable, such as employment, unemployment, in- 
come, morbidity, mortality, fertility, education, and mobility. In 
such problems occupation is one of a number of descriptive 
items normally ascertained, others being sex, color, age, residence, 
and place of origin. In general, the purpose of including occupa- 
tion is to test or measure the relationship between the part which 

ople play in the processes of production and distribution and 
their biological, social, and economic characteristics. 

In order to avoid circular reasoning in such studies, it is clear 
that the occupational classification must be based on occupational 
criteria as far as possible rather than on other criteria—that it 
should be determined by the nature of the work done and should 
be quite independent o other variables. For instance, in analyz- 
ing the relation between occupation and the incidence of indus- 
trial accidents, it would weaken the argument to use an occu- 

ational classification which in itself was based in part on the 
incidence of accidents [e.g., “hazardous” occupations]. Similarly 
in analyzing the relation between occupation and income it would 
be questionable to use an occupational classification which di- 
rectly rested on an income criterion. . . 

Deciding on the degree of detail to be obtained through its 
occupational classification scheme was a second important prob- 
lem facing the Committee. It is an elementary fact that the final 
factor limiting detail is the precision of the source data, for exam- 
ple, whether the primary returns distinguish clearly the “welder” 
who is scarcely more than a machine operator from the “welder” 
who is a highly skilled craftsman. . . 

In the opinion of the Committee, the dangers of misclassi- 
fication as a result of using occu ational classification schemes 
‘more refined than the data will justify, are very considerable ; 
and the damage cannot always be repaired subsequently by rear- 


26 P. K. Whelpton and Edward Hollander, “A Standard Occupational and 
Industrial Classification of Workers,” Soc. Forces, 18:488-94, 1940- 
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rangement or combination. Accordingly the list of occupations 
prepared by the Committee is keyed tofa level of precision of 
source data that can reasonably “be expected from a household 
enumeration in which most of the data are obtained from persons 
other than the worker himself, but closely associated with im... 

If a working population is to be classified both occupationally. 
and industrially it is recommended by the Committee that in 
principle each occupation be treated as existing in several indus- 
tries rather than as being confined to a single industry. Under 


this procedure all telephone operators are not classified under the + — 


telephone industry, where the majority work, but are distributed 
among the department stores, railroads, airplane factories, and 
other industries employing them. Exceptions to this plan are 
advantageous in the few cases where the inclusion of industrial 
terminology in an occupational title restricts that occupation to 
a particular industry. For example, the occupation “College 

residents, professors and instructors” can be found only in the 
industry “Educational services,” and the occupation “Servants, 
private family” can be found only in the industry “Domestic 
service. . .” 

In applying an industrial classification to employees of a 
diversified organization an important question is the relative con- 
sideration which should be given to the activity at the place of 
work and to the main activity of the organization. For example, 
should miners in a coal mine and nurses in a hospital owned and 
operated by a steel ie oe be classified industrially under the 
titles “Coal mining” and “Iron, steel and their products” ? Care- 
ful study of the problems involved led to the conclusion that it is 
more significant and useful in such cases to base the decision on 
the activity at the immediate place of work — the coal mine and 
the hospital in the above cases. On the other hand, it ap ared 
to the Committee that workers in an auxiliary establishment 
which has an integral technical or servicing relation to the major 
activity of a parent industry — as the wooden pattern shop of a 


machinery factory or the or warehouse of a department 


store —should be classified under the major activity. 

A somewhat different problem arises in classifyin employees 
working at establishments not operated by their eagle ers. For 
example, in which industry should a demonstration girl workins 
at a department store (retail trade industry) but on the pay rol 
of a cosmetic manufacturer (chemical industry) be classified ? 
Similarly, in which industry should a musician playing in an 
orchestra at a hotel restaurant (hotel industry) but employed by 
the orchestra conductor (recreation and amusement industry) be 
classified? Such questions are difficult to answer, but the ma- 
jority of the Committee believe that the employeremployee rela- 
tionship should be the determining factor. On t ese bases, 
“industry of employer” should take precedence over “activity at 


bi 
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immediate place of work” in the industrial classification of em- 
ployees working at establishments not operated by their em- 
ployers. ; $ 


In addition to such practical problems in classification, 
science sometimes demands classifications so comprehensive 
as to provide a place for all the phenomena in a given field 
of study. Thus, Carl von Linné (Linnaeus, 1707-78), the 
Swedish botanist, developed a system of classification for 
every known animal and plant. Briefly, his system consisted 
of placing any specimen first in a class, then in an order, then 
in a genus, and finally in a species.” This system, with mod- 
ifications, is still the basis accepted by all naturalists. The 
symbols for designating genus and species (e.g, Canis fami- 
liaris, the domestic dog) are, furthermore, found in the text- 
books of all nations, thus eliminating the quarrels and mis- 
understandings arising from difficulties of translating the 
exact meanings of words. The number of classifications re- 
sulting from this system is, of course, extremely large, Thus 
Wheeler says that there are fully 10,000 species of social in- 
sects alone, each of which may be said to have its own pe- 
culiar pattern of social behavior. This is a fact which might 
be considered by those social scientists who feel that the mul- 
tiplicity and variety of social phenomena is especially great 
and hence’ prohibitive of systematic treatment. 


I. SUMMARY AND CONCLUSION 


The limitations of the human organism compel it to re- 
spond to a limited number of things at any one time. ‘The 
resulting parts or units represent in no sense a “discovery” 
on our part of any inherent division in the whole. These 
parts are merely constructs of our own making— new ways 
of reacting to a situation. The only test of their validity is 
their utility in helping us to achieve whatever purpose we 
are seeking to accomplish. But because of the organization 
of our thinking from earliest childhood in terms of socially 
* 2 For a brief reference to Linné’s work, see F. W. Westaway, The Endless 
Quest : 3000 Years of Science, (Hillman-Curl, Inc., New York, 1936), pP- 735° 
37- The tremendous importance of a classification like Mendeléeff's Periodic 


Table of Chemical Elements should also be mentioned in this connection. 
See Ibid., pp. 442-45. for a brief discussion. 
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accepted units and classes, there is a tendency to overlook the 
fact that there is a socio-cultural, habitual aspect always in- 
herent even in the most “natural” classes. We admit the con- 
tinuity of nature in the abstract, but tend to forget this fact 
in our habit of looking at the universe in parts, which is the 
only way we can deal with it practically. As a result, the fun- 
damental “artificiality” and indefiniteness of all units and 
classes are frequently overlooked. But the so-called “arti- 
ficiality” and indefiniteness of all units and classifications 
does not make them any less important as devices through 
which we make our adjustments to the world. 

The present chapter has attempted, first, to illustrate the 
necessity of a uniform terminology in any field of inquiry 
which men wish to discuss intelligently, or which they aspire 
to bring within the domain of science. Second, it was shown 
that such uniformity is possible in the highest degree only 
through the objectification of the terms and concepts em- 
ployed so that they can be measured by means of mechanical 
devices outside of the organism, which were described in the 
preceding chapter. Third, some of the shortcomings of the 
terminology, the units, and the classifications of the social 
sciences with reference to uniformity and objectivity were 
discussed and illustrated. 

A remedy for this situation was next outlined. Certain 
general criteria of a scientific term or concept were laid down. 
We emphasized the danger of assuming that the terms 
adopted represent any final or inherent delimitation of na- 
ture such as is implied in expressions like “the very nature 
of the thing,” “the ultimate reality,” “the thing in itself,” 
and so forth. The morass into which such assumptions about 
terminology have led us in the fields of psychology and so- 
ciology was discussed at some length. One aspect of this 
difficulty was illustrated by the controversy in sociology over 
the alleged greater “reality” of the concept “individual,” as 
compared with the concept “society.” The general confu- 
sion on the subject of the relationship of the whole and its 
parts were shown to be the result of a general misunder-» 
standing of the nature of conceptual analysis. Finally, the 
concrete requirements of satisfactory units and of classifica- 
tion generally were discussed and illustrated. 
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J. SUGGESTIONS FOR FURTHER STUDY 


as Supplementary reading for elaboration of the subjects 
of this chapter will be found in the author’s Foundations of 
Sociology, Chaps. 2, 3, 7, and 9. 

2. On operationism and operational definitions, the fol- 
lowing literature, in the order given, will be found of inter- 
est : 


a. H. Alpert, “Operational Definitions in Sociology,” 
Amer. Sociol. Rev., 3:855-61, 1938. 

b. P. W. Bridgman, The Logic of Modern Physics, (Mac- 
millan, 1932), Chap. 1. 

c. H. Hart, “Operationism Analyzed Operationally,” 
Phil. of Sci., '7:288-313, 1940. 

d. S. S. Stevens, “The Operational Definition of Psy- 
chological Concepts,” Psychol. Rev., 42:517-27, 1935- 

e. S. S. Stevens, “The Operational Basis of Psychology,” 
Amer. Jour. Psych., 47:323-30, 1935- 4 À 

f R. B. Lindsay, “Critique of Operationalism in Phys- 
ics,” Phil. of Sci., 4:456-70, 1937- 

g. G+ A. Lundberg, “Operational Definitions in the So- 
cial Sciences,” Amer. Jour. Sociol., Jan. 1942. 


3. A brief summary of S. C. Dodd’s work on the definition 
and classification of sociological concepts will be ‘found in 
“A System of Operationally Defined Concepts for Sociology,” 
Amer. Sociol. Rev., 4:619-34, 1939- 

4. On the theory and practice of formal classification, see 
K. Menger, “An Exact Theory of Social Groups and Rela- 
tions,” Amer. Jour. Sociol., 43:790-98, 1938. Also G. A. Lund- 
berg, “Some Problems of Group Classification and Meas- 
urement,” Amer. Sociol. Rev., 5:351-60, 1940. Regarding 
practical psychological aspects of classification, see P. F. La- 
zarsfeld, The Technique of Marketing Research, (McGraw- 
Hill, 1937), Chap. 11. 

5. For practical experiments in the classification of social 
groups, see Dwight Sanderson, “Group Description,” Soc. 
Forces, 16:309-19, 1938. Also “Preliminary Group Classi- 
fication Based on Structure,” Soc. Forces, 17:1-6, 1938. 
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6. For an excellent brief discussion of methods of classi- 
fication in the physical and biological sciences, see V. F. 
Lenzen, “Procedures of Empirical Science,” International En- 
cyclopedia of Unified Science, Vol. 1, No. 5, pp. 31-35. 
Univ, of Chicago Press, 1938. 


» CHAPTER IV 


THE PRINCIPAL METHODS OF 
SOCIAL RESEARCH 


We do not speak of qualitative versus quantitative analysis. We do not 
seek to prove that one type should predominate over the other. Instead of 
dogmatizing about method at large, we are experimenting with methods in 
detail, In the measure of our proficiencies, we all practice both qualitative 
and quantitative analysis, shifting our emphasis according to the tasks we 
have in hand. And we are all eager to see our colleagues develop both types 
is ARAA the limits of efficiency in the tasks which they essay. — Wesley 

. Mitchell4 


A. THE SIMILARITY OF DIFFERENT METHODS 


The term science has been used in this book to designate 
the method of procedure which is today accepted as the 
method of natural science? That is, it has been employed 
to refer only to those disciplines which engage in the objec- 
tive classification of their data for the purpose of reducing a 
wide range of observed phenomena of behavior to a brief, 
generalized statement or formula. This is not, of course, 
the only useful type of systematic accumulation of knowl- 
edge. We may be interested merely in discovering and set- 
ting forth specific temporal or spatial events or sequences, as 
in history and geography. These disciplines are sometimes 
called descriptive sciences, but since all science is descriptive, 
such a distinction is of doubtful value. While the terms “‘his- 
tory and geography” are usually thought of as primarily de- 
noting subject-matter, they also imply a particular method. 
This method is usually thought of as differing from the 
method of natural science in that the method of history con- 
cerns itself chiefly with a recording of each unique datum 
occurring in the immediate or remote past, rather than with 
the quantitative classification of data for purposes of gen- 
eralization. This distinction, however, it will be seen on 


1 “Quantitative Analysis in Economic Theory,” Amer. Econ. Rev., 15:1, 


1925. 
fhe term “natural science” is here used in its commonly asepe mean- 
ing, as it applies to sciences like chemistry, physics and others. Of course all 
science is “natural” and social phenomena are as much so as the phenomena 
of chemistry or physics. 
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further investigation, is one of the degree to which a process 
is carried out rather than a difference in kind. The original 
observations of natural science are also observations of ap- 
parently unique events in the order of occurrence. But nat- 
ural science does not stop at this point. It proceeds to 
ignore some of the unique aspects of its data in making classi- 
fications according to certain similarities. Science always 
abstracts certain specific elements or aspects of a total reality 
and analyzes them from the particular angle which is rele- 
vant to the immediate purpose. From these classifications of 
similarities, uniformities, and sequences it makes its general- 
izations. Thus, it merely carries further a process which in 
its early stages is in all important respects like the method of 
history. Pt 


B. THE HISTORICAL METHOD AND NATURAL SCIENCE 


Even strictly historical method, however, tends to merge 
gradually and more or less objectively into the method of 
natural science. To the extent that historians go beyond the 
mere recording of facts in their temporal sequence, and be- 
come “comparative historians,” that is, attempt to reduce 
their data to types, and to generalize from them, history be- 
comes a natural science. Since there are usually present in 
the works of all historians such groupings of facts and gen- 
eralizations more or less objectively carried out, it is clear 
that no arbitrary line can be drawn between natural science 
and other disciplines primarily historical in character. They 
tend to merge gradually into each other and differ chiefly in 
the degree to which they carry out their processes, and in the 
objectivity of their procedure. The same may be said of the 
crude common-sense method of the child or of the pro- 
verbial “man on the street.” As soon as a child begins to 
experience likenesses and differences in his responses to his 
surroundings, he is engaged in observation and classification, 
however rudimentary the process may be in the conventional 
terms of scientific method. These similarity-and-difference 
responses will probably be in terms of number, size, shape, 
speed or intensity of objects, characteristics, or events. In 
making responses in such terms he is engaged in a rudimen- 
tary form of measurement or enumeration. From the fre- 
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quency of recurrences, similarities, or sequences thus experi- 
enced, every normal person generalizes more or less broadly. 
All that science adds to this common-sense procedure is ac- 
curacy and objectivity. 

The purpose of pointing out at some length this essential 
similarity of different methods, is to avoid much futile con- 
troversy over the desirability of various methods at present 
employed in the social sciences. To point out the greater 
completeness, objectivity, and accuracy of the methods of 
natural science is not to deny altogether the value of less 
complete and less accurate procedures. Tentatively, at least, 
even the results of entirely random and subjective inquiry 
may have some value in suggesting problems and affording 
clues to solutions. Pure chronologies of events, even if they 
are in such form as to preclude the possibility of ever deduc- 
ing scientific principles from them, may be of great practical 
value for certain purposes and essential to intelligent be- 
havior with reference to particular situations. This is true 
of history as a pure record of events and of case histories of 
an individual, a group, or an institution. An accurate knowl- 
edge of the past of each may enable us to predict their future 
behavior with a high degree of accuracy. But on what is our 
reliance on such prediction based? Clearly, it can be only 
our knowledge of certain sequences in a number of cases 
which lend any validity to such predictions. (See Chap. I, 
Sec. J.) The degree to which these individual, records be- 
come analytical, comparative and statistical will determine 
their value for accurate prediction, which is the chief stand- 
point from which they have any scientific value. In anthro- 
pology this transition toward the method of natural science 
is fully recognized. Thus Boas defines anthropology as “the 
science that endeavors to reconstruct the early history of man- 
kind, and that tries, wherever possible, to express in the form 
of laws ever-recurring modes of historical happenings.” * 


(Italics ours.) à 
From this definition it would obviously be futile to argue 


3Franz Boas, Anthropology. Columbia University Lectures on Science, 
Philosophy, and Art, spar p. 8. See also Clyde Kluckhohn, “On Certain 


Recent Applications of Association Coefficients to Ethnological Data,” Amer. 
Anthropol AMBI. 1939. Also A. Lesser, “Research Procedute and Laws 


of Culture,” Phil. of Sci., 6:482-95, 1937- 
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whether anthropology is a natural science. It depends on 
the degree to which a particular worker devotes himself 
primarily to the “reconstruction of the early history of man- 
kind” or to the formulation of laws of “ever-recurring modes 
of historical happenings.” If he is primarily engaged in the 
latter activity he is engaged in natural science. But a rather 
complete accumulation of the historical facts is necessary, as 
is the case in every scientific discipline, before this method 
becomes possible. There is no question, therefore, of the 
relative value of the two methods of procedure. They are 
different stages in the same attempt to understand, explain, 
predict, and control cultural events.* 


C. COMPLEMENTARY NATURE OF DIFFERENT METHODS 


This view of different methods as different stages in a com- 
mon process should serve to set at rest much futile contro- 
versy over the superiority or inferiority of different methods 
in the social sciences, and especially in sociology. Each has 
its place, and, for a particular purpose, or at a particular stage 
of investigation, is best. Instead of speaking of the historical 
method, the “case” method, the comparative method, the 
statistical method, etc., as mutually exclusive and even antag- 
onistic, we should recognize that these methods are all legiti- 
mate stages in human adjustment-technique, the completest 
form of which we call natural science. The fallacy in con- 
troversies over the value of these different methods lies in 
regarding each as independent, sufficient, or complete in it- 
self, Any method which achieves its purpose is valid for that 
purpose. Our methods should change as they are adapted 
to different ends or different stages of development of a field 
of inquiry. Even theological and metaphysical methods of 
approach, to use two of Comte’s stages, have their value in 
certain stages of development of any field of knowledge. 
They may furnish working hypotheses and they furnish a 
tentative form of adjustment. Historical data and general- 
izations, of course, serve in the same way, but may also serve 

4The works of James Harvey Robinson, James Shotwell, Charles A. Beard, 
Harry Elmer ‘Barnes, and Howard Becker should be especially noted as illus- 
trative of the contribution of the historical method to sollo e 


7 gy. See espe- 
cially, Barnes, Becker, and Becker, Contemporary Social Theory, (D. Appleton: 
Century Co., 1940), Chap. 15. 
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as'case material for statistical classification and generalization 
according to the methods of natural science. In fact, from 
one point of view, all the raw material of natural science is 
historical, i.e., it represents events which have taken place in 
the past. From the standpoint of verification, corroboration 
and objectivity of statement, their usefulness varies greatly 
according to the remoteness of their occurrence, the accuracy 
with which they are observed and the form in which they are 
recorded. Since this form in turn depends largely on the 
purpose in mind at the time of observation, it is often very 
difficult to use a body of facts compiled for purely historical 
purposes as data for natural science. The selective process 
in writing history renders much historical material of doubt- 
ful value for scientific purposes both on account of its neglect 
of some of the data, and the non-quantitative statement of 
those that are recorded. These considerations apply of 
course to all of the so-called life history documents — letters, 
diaries, biographies, etc.—as well as to more formal histori- 
cal documents. 

But scientific observation also consists of selecting from an 
innumerable list of characteristics and relationships certain 
arbitrarily chosen aspects of clearly defined objects for re- 
cording, and excluding all other aspects. The limitations 
of our mechanisms of response and symbolization (i.e., our 
senses and language symbols) make this unavoidable. Among 
these limitations is the inability of the senses to grasp more 
than a comparatively few facts at a time. This makes it nec- 
essary for us to select for consideration only certain charac- 
teristics from the multitudinous phenomena which constitute 
the universe. Out of the things selected we make scientific 
facts, ie., generalizations. This necessity for selection gives 
rise to a question of fundamental importance to science, 
namely: How shall we determine in a particular investiga- 
tion which factors to select and which to ignore? 


v D. THE NATURE AND FUNCTION OF WORKING HYPOTHESES 


When we are confronted with a concrete everyday prob- 
lem we answer this question by taking note only of those 
elements which we think are significantly related to our 
problem. What we think is significant, however, will be de- 


ca 
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termined by our previous experience. That is, we will pro- 
ceed to look for factors which were significant in other similar 
problems with which we have been confronted. The sim- 
ilarity may be great or small. If our situation or problem 
is so strange that we are unable to compare it at all with 
other situations or problems to which we have been exposed, 
we are either unable to make any adjustment to it at all, or 
our adjustments will be of a blind trial-and-error type. The 
assuinptions which we do make regarding the significant fac- 
tors in a situation or problem become our hypothesis, on 
the basis of which we begin to make special note of the data 
which tend to bear out or contradict our hypothesis. In the 
everyday world, therefore, we usually construct hypotheses 
as to the cause and solution of all our problems on the basis 
of analogy — the assumption being that, because two things 
resemble each other in one or a number of points, they there- 
fore probably resemble each other in some other point or 
points which are significant for our purpose. 

It is because this assumption is so frequently used as a 
conclusion and a proof, instead of as a hypothesis to be inves- 
tigated, that analogy is in such bad repute as a scientific 
process. Analogy is useful and defensible only when points 
of difference as well as of similarity are noted. Again, it is 
not so much the numerical preponderance of similarities or 
differences which are of importance, as their significance for 
the purpose at hand. When properly used, analogy is, not 
only in everyday experience, but in science as well, our most 
fruitful source of hypotheses. 

A working hypothesis is necessary before the intelligent 
collection of data can begin. Even a general collection of 
data like the Federal Census has behind it as an hypothesis 
the assumption that the type of data which has been found 
useful in the past will be useful in the future. Without an 
hypothesis we should have to set out to record all our re- 
sponses to environmental stimuli, i.e., all of the perceptions 
of our senses. But as has already been pointed out in a previ- 
ous chapter, what our senses respond to is determined largely 
by what they “look for.” Whether we intend it or not, 
therefore, there will be a selection of data, on the basis of 
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our previous experience. This is all that a working hypoth- 
esis is—a clearly recognized assumption, based on previous 
experience, of what are probably significant factors in a sit- 
uation. The only difference between gathering data with- 
out an hypothesis and gathering them with one is that in the 
latter case we deliberately recognize the limitations of our 
senses and attempt to reduce their fallibility by limiting our 
field of investigation so as to permit a greater concentration 
of attention on the particular aspects which past experience 
leads us to believe are significant for our purpose. Even if 
we started out by recording all of our responses, we should 
be compelled to make a selection from the mass so recorded 
before we could use these observations for scientific pur- 
poses. Such a selection would have to be on the basis of | 
some hypothesis. The simpler and more satisfactory way, 
therefore, is to make the original observations on the basis 
of the chosen hypothesis or hypotheses. 

The working hypothesis, then, is a tentative generalization 
based on a cursory, empirical examination of certain data. 
But it may be merely a “hunch” or an “intuitive” supposi- 
tion. Fundamentally, of course, all “hunches,” “intuitions,” 
and hypotheses rest upon an informal, or even “‘subcon- 
scious,” inductive basis. An hypothesis is a tentative gen- 


eralization based on data so vague or inadequate that it is 
fot at the time demonstrable in objective terms. Here, 
again, of course, there is much difference of opinion when an 
hypothesis ceases to be an hypothesis and becomes a scien- 
tific theory or a scientific law. This question turns on the 
adequacy of the data on which the generalization is based, 
which, within limits, will be a riatter of opinion. Usually 
the term “theory” is used to designate a middle ground be- 


the hypothesis, and the fully substantiated and demonstrable 
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E. DANGERS IN THE USE OF HYPOTHESES 


The working hypothesis is, in its simplest form, merely a 
definition of the object of an investigation. As such, it limits 
the problem to proportions, which can be handled with the 
resources at hand, and prevents an aimless and random gather- 
ing of data. It guides in the selection of data. Herein, of 
course, lies a grave danger. If the investigator conceives of 
his hypothesis as a proposition to be defended to the exclu- 
sion of evidence that might support the opposite, or another 
hypothesis, such an approach of course vitiates his work from 
the scientific standpoint. “A working hypothesis,” says Chad- 
dock, “in the scientific sense, is a theory or an explanation 
held after careful canvass of the known facts, in full knowl- 
edge of other explanations that have been offered, and with 
a mind open to a change of view if the facts disclosed by the 
inquiry warrant a different explanation. It is, therefore, 
held with the definite purpose of including in the investiga- 
tion all available and pertinent data, either to prove or to 
disprove the hypothesis. An hypothesis of this character is 
usually desirable and even essential. It gives point to the 
inquiry, and, if founded on sufficient previous knowledge, 
guides the lines of the investigation. Without it much use- 
less data may be collected in the hope that nothing essential 
will be omitted, or important data may be omitted which 
could have been easily included if the purposes of the in- 
quiry had'been more carefully defined. Blind gathering of 
masses of data does not usually lead to the discovery of unex- 
pected relations between facts or result in new explanations.” * 

The disappointment of some students when they discover 
that the facts do not bear out, or contradict, an hypothesis, 
betrays an unscientific attitude. From the purely scientific 
viewpoint, a scientist's contribution is equally great whether 
he proves or disproves an hypothesis. Emotional attachment 
to a preconceived conclusion must always be avoided, for it 
militates against that impartial observation and collection of 
the data, on which depends the ultimate validity of the con- 
clusions, as well as the usefulness of the other steps in the 


5R. E. Chaddock, Principles and Methods of Statistics, (Houghton Mifflin, 
1925), P- 372. 
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prove their hypothesis are admitted. Under such conditions 
we have the pathetic spectacle of people who started out as 


an hypothesis no longer supported by the facts. s 
F. CLASSIFICATION OF METHODS OF COLLECTING DATA 


Methods of gathering data should be distinguished from 
methods of manipulating, summarizing, generalizing, or 
otherwise handling data already collected. The present vol- 
ume is concerned chiefly with the former task. It is true that 
the possible ways of handling data after they are collected is 
largely determined by the method employed in their collec- 
tion. Therefore our technique of collection must be gov- 
_ erned to a large degree by the manner of analysis which we 
expect toemploy. But much of the confusion and argument 
over current classifications of “methods” is due to a failure 
to distinguish between technique of gathering and technique 
of classification, treatment, and manipulation. Thus we hear 
of the historical method, the “case” method, the “statistical” 
method, the “survey” method, etc. There is nothing mu- 
tually exclusive in these classifications. From one point of 
view, all collections of data are collections of “cases.” The 
“survey” is merely a designation of a type of investigation 
with reference to scope, and may utilize any or all methods 
of collecting the data. A useful classification of methods of 
gathering data, like all classifications, must specify, first, the 
basis or purpose of the classification ; and second, its cate- 
gories must be mutually exclusive. 

The principal general methods of collecting data in the so- 
cial sciences may be classified according to (1) scope, (2) 
sources, and (3) technique of collection. 
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With reference to the scope of an investigation, Professor 
Chapin has classified the methods of inquiry as follows : ° (a) 
Case Work, or the study of individual cases. (b) Sampling, or 
the selection of a part of a field on the assumption that the 
part is representative of the whole. (c) Complete enumera- 
tion, or the inclusion of all the data in a field. Principles 
and methods applying to each of these methods of gathering 
data will be treated in later chapters. 

With reference to sources from which data may be drawn 
there are two commonly recognized, namely : 


(1) Historical Sources 
(a) Documents, papers, papyri, stone tablets, etc. 
(b) Geological strata, fossil remains, etc. 

(2) Field Sources 

o Particular information from living individuals. 

b) Direct observation of behavior in process (including 
verbal behavior when our interest is primarily in 
the verbal behavior, per se, rather than in the sit- 
uations represented by the verbal behavior). 

Historical sources represent the records which past events 
have left behind them, and which are recorded and stored 
by devices external to human beings. The memories of liv- 
ing individuals are, of course, also traces of past events, and 
are historical in the sense that all events which have occurred 
are historical. But since the purpose of the present classifi- 
cation of sources is to establish a basis for a discussion of tech- 
niques for collecting data, written or oral information se- 
cured for a particular purpose from living individuals is here 
classified as a field source. The only additional method 
whereby events or facts become known to us is by direct ob- 
servation while they are in process. This will likewise be 
classified as field work. la 


G. DOCUMENTARY SOURCES 


A careful survey of all documentary sources bearing on a 
contemplated inquiry should always be made before begin- 
ning a field inquiry. This is important in order to avoid 
duplication of work, and to suggest methods of approach — 
pitfalls to avoid, the difficulties involved, etc. Furthermore, 

€F. S. Chapin, Field Work and Social Research, (Century, 1920), p. 47- 


— 
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if we are to compare our findings with those of other inves- 
tigators, a careful study of the procedure in other cases is 
necessary. The problem of locating relevant documentary 
sources in any investigation is largely one of familiarizing 
oneself with the various indices through which such sources 
may be located. The amount of documentary material avail- 
able or relevant will, of course, vary greatly with the nature 
of the undertaking. A study of some contemporary aspect 
of a small, local community, such as a survey of public opin- 
ion, may require very little inquiry into documentary or his- 
torical sources, except possibly to find comparable studies in 
other communities containing. important suggestions on 
methods or ‘results. Other studies, such as, for example, a 
study of the trend of real wages over a period of fifty years 
or a study of the evolution of an institution, would require 
extensive library and documentary research. 

No textbook can provide detailed rules for determining 
what data are relevant to a given problem. The person who 
formulates and studies a problem must judge what data are 
competent and relevant for its solution. Ordinarily, worth- 
while interests and problems for research arise only as a re- 
sult of considerable previous familiarity with a field or with 
a subject. If the student does not already possess such fa- 
miliarity, he will probably begin by seeking expert advice 
on the desirable sources for a beginner. If such_personal ad- 
vice is not available, some readily located brief source, such 
as the Encyclopedia of the Social Sciences, may be consulted. 
The sources suggested in the Encyclopedia might next be 
studied, These references will perhaps in turn provide an- 
notated bibliographies which will enable the student to select 
those sources which are most likely to be useful to him in the 
particular inquiry he has in mind. The amount of printed 
material on general subjects in any large library is usually so 
vast that the mere indiscriminate copying of the titles in the 
card catalog is an extremely arduous and wasteful procedure. 
Doubtless the instructor will recommend good general books 
to be used as a background. The general bibliographies 
appended to books and articles may be appended to exhibit 
the scholarship of the author or to suggest sources to mature 
scholars, They may have only limited usefulness for the be- 
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ginner. However, with many if not most projects it is well 
to get a general background of the subject first. 

The student may then seek recent annotated bibliog- 
raphies* by responsible authorities in the field, in order to 
save himself the dismal and frequently useless labor of ex- 
amining hundreds of volumes, many quite obsolete, unscien- 
tific or barren so far as his inquiry is concerned. To be sure, 
the nature of the inquiry must determine the degree to which 
near and remote bibliographical material must be scruti- 
nized. Yet, in view of the rapid accumulation of printed mat- 
ter on nearly every subject, the old ideal of examining all 
such material is not only practically impossible, but is often 
pure pedantry except for the very mature scholar. For the 
college student discriminating selection of sources on the 
basis of what other competent scholars have found useful is 
essential. 

A bibliography, then, should grow as a study progresses, 
rather than be indiscriminately and voluminously gathered 
at the outset from library catalogs or general reference books. 
The student will usually first consult the ablest authority to 
which he has access either in person or through such a me- 
dium as the Encyclopedia of the Social Sciences. He will do 
well next to locate through the library catalog or through 
periodical indices such as the Readers’ Guide to Periodical 
Literature any existing annotated bibliographies on the sub- 
ject from which he can select the sources most likely to be 
useful to him. Good recent articles may give useful leads to 
further sources. After perusing these sources he will be in a 
better position to determine how much further and in what 
directions inquiry is likely to be fruitful. The “Topical 
Summaries of Current Literature” which appears from time 
to time in The American Journal of Sociology as well as the 
“Abstracts of Periodical Literature,” which is a regular fea- 
ture of this journal, will often prove useful.’ But these bib- 
liographies are compiled mainly for professional sociologists 


TSome such bibliographies are listed in the Appendix. 

8 The abstracting service of The American Journal of Sociology was carried 
on from 1895 through November 1928, when Social Science Abstracts took over 
this function until its discontinuance in 1932. The service was resumed tem- 
porarily by The American Journal of logy, in Volumes 38 and gg (1932- 
$3) and was regularly resumed in Volume 44 (January, 1939). 
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rather than for college undergraduates. For the years 
1929-32 Social Science Abstracts constitutes for students an 
exhaustive review of current material in all of the social 
sciences except psychology, which field is covered by Psycho- 
logical Abstracts. The American Sociological Review also 
carries from time to time selected bibliographies on various 
sociological subjects.? The annual list of “Student Disserta- 
tions in Sociology,” published by The American Journal of 
Sociology and the “Census of Social Research” published 
annually in The American Sociological Review furnish addi- 
tional information on recently completed or incomplete re- 
search under way in this field. Finally there should be men- 
tioned the Index of Research Projects (Volumes I, II, and III, 
1938-39) of the Works Progress Administration made in col- 
laboration with the National Resources Committee and State 
Planning Agencies. These volumes contain a systematic 
classification of several thousand abstracts of recent studies 
conducted under the auspices of these agencies? The Cata- 
logue of Research and Statistical Publications of the Federal 
Works Agency, (WPA, March, 1941) also contains several 
hundred references to recent research under the Work Proj- 
ects Administration. 


H. CRITICAL EVALUATION OF DOCUMENTARY SOURCES 


Having located the relevant documentary sources, their 
critical evaluation is the next problem. For convenience in 
evaluation, documentary sources are usually divided into two 
classes, namely, (1) primary and (2) secondary. 

By primary sources we mean any sources providing data 
gathered at first hand, and compiled by the same authority 
under whom the data were collected from the field. Second- 
ary sources are any other sources. Thus, the registration 
cards in the Registrar’s office of a university would be a pri- 
mary source for a study of the age and sex distribution of the 
student body. The compilation of these records in the uni- 


versity catalog would also be a primary source. A newspaper 

d Specific references to annotated bibliographies will be found in the Ap- 
IX. - 

Poo Students who are not qran familiar with the facilities and methods of 


using large libraries should rea E. Spahr and R. J. Swenson, Methods 
and Status of Scientific Research, (Harper, 1930), Chaps. 7 and 8. 
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report of these figures, however, would be a secondary source. 
Likewise, the complete census reports of the Federal Gov- 
ernment are primary sources, but census figures appearing 
in the World Almanac or the Statistical Abstract of the 
United States are secondary. “When the responsibility for 
gathering the original data and for their promulgation is 
undivided, whether in published form or other available 
forms) the source may be called primary. But when the au- 
thority for the data as promulgated is different from that 
which controlled the collection of the facts at first hand the 
source containing such data may be called secondary.” ™ 

The importance of this distinction lies in the probable 
relative reliability of the two types of sources. Primary 
sources are no more reliable than the persons or agencies 
responsible for them. But if they are really primary, and 
therefore not subject to verification or correction by the per- 
son using the data taken from such sources, the only criti- 
cism possible is in the nature of correction or rejection of 
the data as a whole because of biasing influences or faulty 
technique known to have attended their accumulation. Sec- 
ondary sources are subject to error in proportion to the dis- 
tance of their removal from the primary source. The mere 
processes of transcription, reprinting, etc., afford opportuni- 
ties for unintentional errors. In addition there are the al- 
most unlimited opportunities in secondary sources for mis- 
use of the primary sources through various manipulations — 
partial presentation dictated by bias, rearrangements, neg- 
lect of considerations of comparability, and neglect of quali- 
fying conditions contained in the primary sources. 

Since data taken from documentary sources are subject to 
these numerous errors which may vitiate, largely or entirely, 
their value for scientific purposes, it becomes necessary to 
scrutinize carefully all data drawn from documentary sources. 
If the data are drawn from a secondary source, the first and 
most satisfactory method of checking them is to compare them 
with the ‘primary source when that is available. When the 

MR. E. Chaddock, Principles and Methods of Statistics, p. 992. Secrist 
points out that the distinction between primary and secondary data is largely 
onë of degree, and also that “data which are ry in the hands of one 


party may be primary in the hands ^ Horace Secrist, An Intro- 
duction to Statistical Methods (revised edition, 1925), p. 25. 
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primary source is not available, we have to apply to second- 
ary sources the same principles of criticism that we should 
apply to primary sources. 

The principles of criticism to be applied to all documen- 
tary sources have been well summarized by Chapin as fol- 
lows: ¥ 


A — Documents should first be criticized as to their external or 
objective characteristics. 


1) Authorship should be critically examined. 

2) Sources should be critically classified. 

(3) The investigator should avoid hypercriticism which makes 
the tool the goal and not the means to the end of knowl- 
edge of the truth. 


B — Documents should then be criticized as to their internal or 
subjective characteristics. This is the more important division 
of criticism. It is analytical criticism. 


(1) What did the author mean by this particular statement ? 
What is its real meaning as distinguished from its mere 
literal meaning ? 


(2) Was the statement made in good faith? 

a) Had the author interest in deceiving the reader? 

(b) Was the author under pressure to tell an untruth? 

(c) Was he influenced by sympathy or antipathy to tell an 
untruth ? 

(d) Did vanity influence him? 

(e) Was he influenced by public opinion? 

(f) Is there evidence of literary or dramatic motives to dis- 
tort the truth? 


(3) Was the statement accurate? or more particularly : 
(a) Was the author a poor observer because of mental de- 
fect or abnormality? 
(b) Was the author badly situated in time and place to 
observe ? 
(c) Was he negligent or indifferent ? 
d) Was the fact of such a nature that it could not be 
directly observed ? 
(e) Was the author a mere witness or a trained observer ? 
(4) When it appears that the author was not the original ob- 
server it is necessary to determine the truth and accuracy 
of his sources of information. 
C—Particular facts may. be determined by the procedure of 
comparison which weighs the importance of contradictions and 


12 F, S. Chapin, op. cit, pp. 37 38. 
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agreements and concludes at the most with a determination of 
the degree of probability. 


I. METHODS OF FIELD INVESTIGATION 


After the general problem of an investigation has been de- 
fined; after a working hypothesis has been adopted; and 
after all relevant documentary sources, primary or secondary, 
have been scrutinized, criticized, and evaluated, the methods 
of conducting the field inquiry, if one is required, must be 
decided upon. In this preliminary work, such questions as 
what is the problem, what is the nature of the subject-matter, 
and what form of solution is sought, will have been decided. 
These questions will in turn determine to a large extent the 
minimum scope of the inquiry; that is, the minimum data 
required to justify drawing the type of conclusion sought. 
This is, the problem (to be considered in the next chapter) 
of determining the smallest sample which will be statistically 
valid for the purpose at hand. Other things being equal, 
the larger the sample, the more reliable the conclusions. 
The comprehensiveness of an inquiry will be determined 
largely by the degree of reliability required in the results, 
and by the time, money, equipment, and staff at the disposal 
of those directing the inquiry. These considerations will 
also determine to a large extent the methods to be used in 
obtaining the data. 

There are two principal ways of gathering facts from the 
field: (1) In the direct method the investigator may observe 
and record data by his own direct observation. (2) In the in- 
direct method the investigator secures his data from others, 
perhaps from the oral or written testimony of persons having 
had first-hand contact with the data. 

The first method is illustrated by the investigator who sets 
out to record the number of houses in a certain district ; 
their size and condition with respect to heat and light; and 
distances to schools and churches. He may further record 
from direct observation the characteristics, activities, and be- 
havior of the people living in the district. He may deter- 
mine their race and nationality, their attendance at libraries, 
schools, and social gatherings ; the extent to which they pass 
a certain corner, travel by certain conveyances, and visit cer- 


EE <_< - i e 


METHODS OF SOCIAL RESEARCH 129 


tain stores. Finally, the investigator may by direct observa- 
tion record socially important structural and functional facts 
about individual persons in terms of a cross-section of their 
present status. In this category fall the more intensive phys- 
ical and psychological laboratory tests such as would be in- 
cluded in a medical or psychiatric examination. A number 
of tests have been devised to measure such things as reaction- 
time, coordination of impulses, emotional responses. There 
are also a large number of verbal tests designed to determine 
so-called “attitudes” and “interests.” 13. Whether the latter 
can be classified properly as direct observation or not depends 
on whether the behaviors in question are regarded merely as 
first-hand observations of verbal behavior or whether they 
are taken as reliable indications of what the total behavior 
of the individual would be in a real-life situation. In the lat- 
ter case these tests belong rather in the category of first-hand 
observations. 

The advantages of the method of direct observation by a 
skilled investigator and the corresponding disadvantages of 
the indirect method may be briefly summarized as follows : 
(1) The direct method allows objective observation by a 
skilled observer and is subject to checking by other observers. 
The observations of casual witnesses are necessarily unskilled, 
unreliable and difficult to verify. (2) The direct method 
gives us an account of the actual behavior in a genuine life- 
situation, as contrasted with the behavior which an individ- 
ual thinks he has performed or would perform under actual 
or hypothetical conditions. It eliminates errors arising from 
misunderstandings and misinterpretations of words and the 
bias likely to be present in a verbal report secured from a 
person supposed to have had access to the original facts, but 
who must rely on his memory for many of them. Our knowl- 
edge of the limitations of associative memory and the réle of 
suggestion and fantasy in human behavior must cause us to 
accept all such reports with many reservations. Finally, (3) 
the direct method is the method of physical science, and has 

13 For a summary discussion of these tests, see Chap. VIII below. See also, 
Kimball Young, “The Measurement of Personal and Social Traits,” Publ. 


Amer. Sociol. Soc., 21:93-100, 1927. Also G. W. Allport, Personality, (Holt, 
1937), Chap. 16. For a comp’ ive list of such tests, see W. A. McCall, 


Measurement, (Macmillan, 1939), Chap. 7." 
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all the advantages of accurate observation and measurement. 
Whenever and wherever the nature of the investigation and 
the resources of the investigator permit, the direct method 
should be employed. ; 

The method of direct observation, however, has certain 
limitations which have compelled most social investigators 
up to the present to collect their facts by the second method 
mentioned above, namely, the indirect method of inquiry 
from human beings. This is largely due to the difficulties 
of controlled experimentation in the social sciences, referred 
to in a previous chapter. Through his controlled experi: 
mentation in the laboratory, the physical scientist can ar- 
range to have the behavior in which he is interested occur 
repeatedly on a small scale under conditions which assure 
that his observations will be valid for the phenomena under 
natural conditions. He can, furthermore, make observations 
without having to wait for the fortuitous occurrence of the 
phenomena in nature. He can even shorten the time space 
during which a process takes place, by providing artificial 
environmental conditions, Thus, artificial weathering con- 
ditions can be arranged in the laboratory which will test in 
one year the weather-resistance’ of a paint as fully as ten or 
more years of exposure to natural weather. 

The social scientist, on the other hand, lacking the control 
of the laboratory in the observation of his data, must to a 
large extent await the fortuitous occurrence of the data he is 
seeking, or rely on reports from the memory of other people, 
or on historical records. Most of the facts gathered by the 
Federal Census, as well as by other governmental agencies, 
are secured in this second-hand way. A person’s age, race, 
and nationality and religious affiliation, as well as the state- 
ment of his possessions, income, and business dealings, are 
secured largely from the person interviewed. There is little 
attempt at checking of his statements because some of them 
are not subject to check, and the time, labor, and expense 
involved in checking others would be prohibitive. We take 
the individual’s word for the truth of his statements, al- 
though we know that, leaving out all questions of intentional 
falsification on the part of the Teporter, his memory and 
knowledge of his own affairs will enable him to give only 
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approximately correct answers. At the present time, there- 
fore, we rely on the untrained reporting of the great mass of 
individuals of society for most of our social data. Until we 
develop a very much more comprehensive and efficient sys- 
tem of official recording of significant events as they occur, 
we shall have to continue to rely on this personal testimony. 
As was pointed out in the second chapter, however, experi- 
mental laboratory techniques with the aid of motion pictures 
may become an important source of certain kinds of socio- 
logical data. 

The fact, then, is that at the present time and for an in- 
definite future, we must continue to rely on others, on indi- 
vidual testimony for a great many of the data of sociology. 
This makes it of the greatest importance that we develop 
techniques of securing from people at large reliable second- 
hand information which will give a high measure of predic- 
tability of behavior without the elaborate, time-consuming, 
and costly process of first-hand observation of the behavior 
at the time it occurred. Toward this end some methods are 
very much more effective than others. It is possible through 
a careful checking of items of information against each other 
and against objectively established facts to determine with a 
high degree of reliability the accuracy of an individual oral 
or written report. Since it is unwise to confine our efforts to 
observing directly only the concrete, first-hand, non-verbal 
behavior of people, our problem becomes one of developing 
a technique for eliciting reliable information from others, 
from the persons in first-hand contact with the facts. This 
technique will usually take one of four forms: (1) Life his- 
tory documents — autobiographies, diaries, confessions, let- 
ters, etc, (2) An informal oral interview (individual or 
group). (3) A more formal filling out by the investigator of 
a schedule on the basis of answers to questions. (4) A ques- 
tionnaire filled out by the informant without personal super- 
vision of the investigator. These methods are treated in 
Chapters VI, VII, and VIII. 


J- SUMMARY AND CONCLUSION 


The general viewpoint of this chapter has been (1) that 
all questions of method ate relative to specific ends, and (2) 
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that the broader the purpose, the more composite must be 
the method adopted. The purpose of natural science being 
to reduce the phenomena of behavior to brief laws or formu- 
las which will be generally valid within their field, the 
method of natural science may be regarded as the broadest 
in scope. This method, therefore, calls for the’fullest com- 
bination of all techniques which produce data relevant to 
the complete generalized description of the phenomena un- 
der consideration. Thus it calls for the use of hypotheses. 
In their earliest and most tentative form these may be noth- 
ing but “intuitions,” “hunches,” “guesses,” or other vague 
forms of “subconscious” induction, It may next call for a 
careful inquiry into the historico-genetic aspects of the phe- 
nomena under consideration, especially when these phenom- 
ena are human and social. This part of the method of nat- 
ural science is usually referred to as the historical method. 
Then we are interested in an objective description of the 
present facts — a cross-section of each type of phenomenon as 
it is and behaves at the present time. Weare interested in 
patterns of behavior, the presence or absence of factors, the 
degree or amount of each factor present, etc., in a given time, 
place, or situation. This is frequently called the case method. 
But each or all of these methods do not reduce the data to 
the category of science until conclusions of general validity 
can be drawn from the data so that reliable predictions of 
behavior can be made. Whether this generalization be 
formal and mathematical or informal and subjective, it will 
be quantitative in nature, and is the sine qua non of scientific 
method.* It does not of itself constitute a complete scien- 
tific method, however, as it rests upon the previous steps in 
the process. "In view of this fact, it is obviously futile from 
the standpoint of science to discuss the relative merits of 
different stages in the process. What may be discussed with 
profit is the degree of objectivity and accuracy of different 
methods of performing each step. We turn now to a con- 
sideration of some aspects of this problem. 


For elaboration of this position, see G. A. Lund! 
ciology, Chap. II. See also Chap. I, Sec. I, of the a aT. 
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K. SUGGESTIONS FOR FURTHER STUDY * 


1. A profound treatise on the fundamental nature of all 
inquiry with special reference to scientific method will be 
found in John Dewey, Logic. The Theory of Inquiry, (Holt, 
1938). Part’ I and IV are the least difficult. Chapter IV on 
“Common Sense and Scientific Inquiry,” is especially rele- 
vant in the present connection. á 

2. A digest of examples of different methods that have 
been employed in actual investigations in the social sciences 
may be found in S. A, Rice (Editor), Methods in Social 
Science (University of Chicago Press, 1931). 

g. Suggestions on systematic note-taking and the use of li- 
braries as well as an extensive list of sources of information 
will be found in W. E. Spahr and R. J. Swenson, Methods and 
Status of Scientific Research, (Harper, 1930), Chapters 7 and 
8 


4. For a lucid discussion of “The Art of Note-Taking” 
and for ideas on securing information from “The Written 
Word” and “The Spoken Word,” see Sidney and Beatrice 
Webb, Methods of Social Study, (Longmans, 1932), Chapters 
IV, V, and VI. sie 

5. For a fuller discussion of the historico-genetic method 
mentioned in the text, see Barnes, Becker, and Becker, Con- 
temporary Social Theory, (Appleton-Century, 1940), ch, 15. 
In this connection the student should also read ch. 2 of the 
same volume on “Constructive Typology in the Social 
Sciences,” the main thought of which is entirely compatible 
with the views ofthe present volume. (See G. A. Lundberg, 
“Some Problems of Group Classification and Measurement,” 
Amer. Sociol. Rev., §:351-54, 1940). See also, P. F. Lazars- 
feld, “Some Remarks on the Typological Procedures in So- 
cial Research,” Zeitschrift fiir Socialforschung, 6:119-49, 
1937: n ‘ 
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In actual practice, in science M A DN life we rely on wheh may be 
called the Principle of Fair Samples, that is to say, the belief that, with rea- 
sonable care, it is possible to judge the character of a p, or of a 
whole class of phenomena, b the aid of a sample-or a sel from it. This 
principle p sometimes called the law of Statistical Regularity, ~ Æ. "Wolf. 
t 


A. THE GENERAL USE OF SAMPLING 
Weisaw in the preceding chapter that all observation is 
necessarily selective. It was pointedout that we select for 


consideration only certain phenomena from the innumer- 
able characteristics and relationships that are.actually pres-i 


` ent. Out of the observations $0 selected we make whatever 


generalizations the data justify, But the data’on which we 
base most generalizations in practical life are only a part— 
a sample — of all the relevant data. The same is true of all 
the generalizations of science. ‘These generalizations, both 
in scien practical life, rest upon the assumption that 
what has found characteristic of the sample data actu- 
cally observed will also be characteristic of the whole body of 
data which We. cannot observe. To the extent that this as- 
sumption is valid, a generalization based on a sample will be 
as valid as a generalization” based on all the data. To the 
extent that this assumption is false, our generalization will 
be invalid. © i y 
Except for the economy of time and etoh which the sam- 
pling method affords, the advance of science would” be in- 


Superably: handicapped. “To study ; all the datarin any field 


would be prohibitive. But, through the sampling method, 
however, we are able to make highly rate nizations 
about the whole from a part. If Boperly selected, accord- 
ing to rules to be discussed later, ten men might, for exam- 
ple, give us as true an average’of the height of ten thousand 
men, as an actual study of the ten thousand. Likewise, un- 
der certain conditions, and for certain purposes, a gallon of 
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water might be truly representative of an entire reservoir. 

But only if certain rules and cautions are observed in the 

selection of these samples do we have assurance of their re 

resentative nature, The princi les governing the scledtice 

of su simples are;our sie ern in the present chapter. 
$ “ 


B. GONDITIONS GOVERNING THE VALIDITY OF A SAMPLE 


The degree to which the assumption that part of the data 
is representative of the whole is valid will be determined by 
two considerations, namely, (1) the nature of the data ob- 
served, and (2) the method employed in, selecting them. 

If the data are highly homogeneous, that is, if the,differ- 
ences between t , vario s items composing the whole body 
of data are negl blo eign any item or group of items is 

„representative of the whole. Hence, the problem of sam- 
pling in a field that is known to be;highly homogeneous is 
ye inne >The high. degree of homogeneity existing in 
some classes of data may result from several different causes. 
The process of production may so thoroughly mix the,com- 


ponent elements of a universe that elements be 
found present in every part in practically the propor- 
tion. For example, a teaspoonful of many foot liquids 


may be ajrepresentative sample of a barrelful of the same 
article provided the matter has been thorough mixed, and 
provided thé component élements are suse ble to such 
mixing. Leen , one square inch of a given cloth is usually 
considered a representative sample of all the cloth in a par- 
ticular bolt, though the latter may contain hundreds of yards. 
The process of weaving and dyeing in this case assures us 0 
a high degree of homogeneity of different parts of the cloth. 
We make the same assumption when ‘we accept a few, bars of 
soap, a few cookies, or a few packages of a product as repre-' 
sentative of a whole consignment. The assurance is, that 
the entire quantity i the same ingredients in the same 
proportion. When a cook tastes food while preparing it, 
he is working on this assumption. When the essential na- 
ture of the data insures a degree of homogeneity so high thet 
any differences that may exist between different parts or items 
are negligible for our purpose, thé problem of sampling is 
simple. A part ig closly representative of the whole. 

iea 4 > 
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In the field of social research, however, we usually wish to 
study the aggregations of phenomena lacking the homoge- 
neity we find in such cases. Stephan has summarized the 
problem, and some methods of dealing with it, as follows: 
“Heterogeneity may be defined roughly as the presence of 
subdivisions or groupings of cases such that the cases vary 
less within one or more of the groups than they do in all 
groups taken together, the difference in variability being 
greater than one might attribute to.chance. The processes 
of economic, social, and racial segregation and other ecologi- 
cal processess which characterize American cities produce 
heterogeneity, i.e., a grouping of people and various social 
phenomena into clusters of similar and supplementary units. 
This clustering by race, religion, nationality, economic status, 
occupation, and by many attitudes and preferenées, is such 
that no cluster is quite like the entire community, and conse- 
quently no one cluster can represent it accurately. Informa- 
tion must be secured from each cluster to provide a faithful 
picture of the whole. Usually there are areas of admixture, 
but even these areas are not representative. Often there are 
no clearly defined groupings because the heterogeneity is a 
matter of continuous change along a gradient shading off 
from a predominance of one type of unit to a predominance 
of another, or from one extreme variant type to its opposite. 
This lack of clear cut divisions complicates the selection of 
a sample which will be representative of all the varieties 
present, in the community. It is like drawing balls from a 
large bin into which the contents of a number of different 
urns have been emptied without being stirred. 

“If all social phenomena were distributed at random geo- 
graphically, it would be quite satisfactory to take as a sample 
those cases which are near at hand, However, the presence 
of clustering makes it necessary to take some cases from each 
cluster, no matter how remote or imaccessible. This intro- 
duces serious practical problems of locating the clusters, 
transporting Investigators to each district, supervising a scat- 
tered field force, and securing information from certain 
types of informants. In most cities, and in rural areas also, 
small groups of families whose behayior and characteristics 
differ widely from other groups are located at relatively in- 
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accessible places. Another kind of iñaccessibility, typified 
by the dog in the yard, the locked door, pretended inability 
to speak English, and feigned ignorance, is often associated 
with geographic inaccessibility, although itis also found in 
very accessible places. Still a third kind of inaccessibility is 
presented, for example, by a childless couple who live in an 
apartment where no one knows them, working during the 
day, eating their meals-out, and spending only a few hours 
at home to sleep”? v 'i 

Our chief practical concern with the problems of sampling 
in’the social sciences, therefore, is a consideration of the vari- 
ous methods and safeguards by means of which the highest 
possible degree of validity in the sample may be attained. 
Only some general principles and a few illustrations can be 
given in d textbook. The application of these principles 
to concrete new problems must be left to the ingenuity and 
resourcefulness of the individual researcher who must adapt 
his methods to’suit his material, his purpose, his situation, 
and hi§ resources. Common sense and ordinary logic rather 
than a blind reliance on rules and formulas are essential. 
A wide familiarity with the practical experience of other 
researchers is likely to be most useful. Accordingly, the stu- 
dent is advised to study the cases mentioned at the end of 
this chapter, and others, for insight into the» practical prob- 
lems of sampling. Oth 


C. METHODS OF SAMPLING 


The requirement of a representative sample is that a part 
of the whole should have, in their relative proportions, all 
the signifi¢ant'characteristics of the larger group. A number 
of devices are employed to achieve this result. All of them 
rest on the principle of random selection. The term random 
is frequently considered as meaning careléss, unipremedi- 
tated, casual, haphazard, and “having no definite aim or de- 
liberate purpose.” * This is an unfortunate interpretation 
of the term, and, if adopted, may give unscientific results. 
For we aim in a random selection so to arrange conditions 


2F, F. Stephan, “Pro of Sampling’ Procedure,” Amer, Sociol. Rev. 
1673) 3 36. EA 
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that every item in the whole body of data from which we 
wish to select the sample'shall have the same chance of being 
selected as any other item. This usually calls for careful 
planning and orderly procedure, especially in éxposing the 
whole field to the selection.” It means insuring the equal 
operation of the method of “chance.” For example, select- 
ing names from a directory by opening it at “random” and 
taking the first name upon which the eye falls, might appear 
to be a truly random selection. But if the directory has been 
in common use, it is likely to open more readily at some 
pages than at others. The result might be a bias in favor of 
the more popular persons or businesses. Likewise the eye 
might be more readily attracted by long or peculiar names. 
Both conditions violate the requirements of a sample truly 
“random,” namely, that all possible items shall have an 
equal chance of being selected. ‘ 

e ee general types ôf sampling may be classified as fol- 
ows: 

1. Random selection, under which each item in ‘the uni- 
verse is assured an equal chance of being selected. 

2. “Purposive” (proportional, stratified) selection, under 
which selection is governed by known characteristics of the 
whole universe so as to secure a similar distribution in the 
sample. See Table I as an example.) Thus a small local 
sample of families could be selected so as to correspond com- 
pletely to the different sizes of families in the country as a 
whole, as shown by census figures. 

3. Mixed sampling, which combines (1) and (2). 

1. Random selection. — (a) Random selection is “often suf- 
ficiently secured by the process of spreading,out a consign- 
ment of goods, etc, and marking one here and another there, 
avoiding therfirst, the last, and the most obvious, and testing 
the objects marked.”* If the objects drawn in this way are 
` apparently similar, ¢.g., cards or capsules containing a num- 

ber, the methéd is satisfactory. If, however, the objects are 
a consignment of goods or other objects toward which sub- 
tle and perhaps entirely unconscious preferences or preju- 


4G. U. Yuale and H. G. Kendall, An 7 i isti 
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dices may exist in the person making the selection, the pro- 
cedure may result in a biased or non-representative sample. 
(b) To avoid the possibility of such bias, it is preferable 
to secure equal chances of selection by “numbering the whole 
groups consecutively, writing down the umbers on tickets 
and shuffling them, and finally drawing at random some of 
the tickets and examining the objects with corresponding 
numbers. “To avoid the writing and drawing, digits are 
sometimes selected at random from mathematical tables and 
used as if they were numbers drawn at random.” ° Drawing 
numbers in capsules ‘that have been thoroughly mixed in a 
rotating urn is essentially the method employed in all lot- 
teries, and is employed in selecting men for the draft army. 
(c) Another method is to select, for example, every tenth 
case, one every two inches, etc., from a series, or from an 
alphabetical list or file. Any such arbitrary selection will 
usually give a representative sample provided the basis of 
the arrangement is unbiased with reference to the char- 
acteristic for which the sampling is being conducted. If 
such sampling is undertaken from a file, care must be exer- 
cised lest some records be temporarily missing. Active cases 
may differ significantly from inactive cases. Again, if some 
cases consist of more than one folder or card, they will have 
more than an equal chance of appearing im the sample.” 
Bias may result also from certain characteristics of the mate- 
rial itself, For example, sometimes all the names beginning 
with a certain letter are taken as a sample. If we are sam- 
pling for nationality, this method would be biased, as a dis- 
proportionately large proportion of names in some nation- 
è 9$ 
6 Ibid, For a fuller description of this progess, seč the account of the use of 
fake Send numbers in G. U. Yale and M. G. Kendall, op. cit, pp. 841-44. 
‘or further details on ible errors of this kind ways of avoiding 
them, see F. F. Stephan, “Practical Problems of Sampling ure” Amer. 
Sociol. Rev., 1:570, 1936. See also the same author's account of the problems 
of sampling involved in the selection, in the Census of 1940, ofa sample of 
two persons out of every forty (five percent) by deiat certain lines of 
the census schedule as sample fines. For example : “If every sheet started with 
a new family, all the individuals on line 1 would be heads, When this is 
the case a sample that includes lines 1, 21, 41, and 61 Will greatly overstate 
the number of heads, and a sample that consistently excludes line 1 will un- 
derstate the number of heads. Similarly, one that consistently includes line 
2 will overstate the number of married women.” “The Sampling Procedure 


of the 1940 Population Census,” by F. F. Stephan, W. E. Deming, and M. H. 
Hansen, Jour. Amer. Statis. Asso., 35:623, 1940. 
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alities begin with a particular letter. Thus Stephan found 
that in Pittsburgh names beginning with F and T formed a 
seven percent sample quite representative of the entire popu- 
lation in economic status, race, and nativity. He warns that 
these results would not hold for a community with a differ- 
ent nationality composition.$ 

A slight variation of the above method is the selection of 
the items representing the maximum, the minimum, the me- 
dian, the quartiles, and two deciles of the sample represented 
as an array. In this way a good estimate could quickly be 
obtained, let us say, of the grade distribution of a large class. 
This might form a reliable basis for comparing it with an- 
other class.° 

2. “Purposive” selection The above methods assume that 
the composition of the total group is not known, and that 
therefore a strictly random selection, or a selection by regu- 
lar intervals, will come scs to giving us a representative 
sample. Yet frequently something is already known of the 
universe from which we wish to select a sample. If this 
knowledge is of a general kind, “investigators may prefer to 
choose what they believe to be typical groups, and examine 
them in detail. Thus investigations as to the wages, etc., of 
agricultural laborers have been conducted by selecting some 
forty districts throughout the country, so as to include types 
of all kinds of agriculture, and of all economic situations. 
This method results in an accurate and intelligible picture, 
but there 1s no easy means of calculating any average, or of 
knowing the distribution by number of persons earning vari- 
ous rates of wages. For filling in details where the general 
results are known the method is to be recommended.” 1° A 
sample of this kind is likely to approach the average of the 
universe, whereas the random sample may vary widely from 
the average. On the other hand, the random sample be- 
comes more and more representative (i.e., it gives a truer 

$F. F. Stephan, “ i i ” i 
Rev., 1: 571, r 936. Ane ae porphin so itnn edt is te 


research in this field which would permit the use of addressograph lists in 


setting up samples for different purposes. Stephan al i- 
po of combining lists with known biases in ooh Hear we seek the 


9A. L. Bowley, op. cit., p. 62. 
10 Ibid., p. 63. Spb te 
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picture of variations in the whole universe) as it becomes 
larger, whereas this need not be true of the “purposive” 
sample, because it has been deliberately chosen to approxi- 
mate the average. This more or less a priori method of se- 
lecting what is typical, is, of course, subject to the bias of the 
investigator. But when the typicality of the groups selected 
can be demonstrated, this is an excellent method. 

In some cases the more or less exact composition of the 
total group with respect to some significant characteristics is 
known before we select our sample. In such cases, we can 
increase the chances of selecting a representative sample by” 
selecting sub-samples proportionate in size to the significant 
characteristics in the total. By thus selecting a sample that. 
is mathematically absolutely representative with regard to 
some significant characteristics we increase the possibility 
that it will also be characteristic in the other respects in 
which we are interested. This sampling technique, some- 
times called proportional or stratified selection, is especially 
important in the social sciences and can best be described by 
an illustration. 

Suppose that in a certain university we know that the total 
registration of 5375 students in three colleges is distributed 
as shown in Table I. Assume that we wish to select the 
smallest possible representative sample of this group of stu- 
dents. The smallest single group in any of the subdivisions 
for which we have information is the women in the senior 
class of the school of business administration with a total of 
50. This group must have at least one representative in the 
sample. Practical purposes will, in fact, as we shall see later, 
usually demand more than a single representative of this 
smallest sub-group. If we assign one representative, to this 
group, however, we should have to assign one and one-half 
representatives to the junior women of this school (who total 
75), to be strictly proportional in our selection. Since this 
cannot be done on account of the nature of the units (human 
beings) we must increase the representation for each group 
sufficiently to give us whole numbers. That is, we must as- 
sign to the smallest group a number of representatives suffi- 
ciently large to give each other group representation accord- 
ing to its size in terms of whole units. To accomplish this, 
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TABLE I , 
Composition and Proportioned Sample of a Student Body 


Total Men 


No.of | Min. No.of | Min. No. of 
Students | Sample | Students | Sample | Students | Sample 


2550 102 


Liberal Arts 800 
Freshmen 600 275 
Sophomores 200 
Juniors 175 

150 


mG 
ON Oms 


m 


DUO AY 


375 
150 
100 
75 
50 


1375 
400 
350 
325 
300 


We must find the greatest common divisor of all the numbers 
in our table. In the present case, we find that this common 
divisor is 25. We accordingly give each group representa- 
tion in our sample in a ratio of 1 to 25. That is, we take a 
four percent sample. This gives each group the representa- 
tion indicated in the table. The total sample of 215 would 
then be strictly representative of the student body in these 
three colleges with regard to sex, college, and class. This 
method of selection could, of course, be indefinitely ex- 
tende take cognizance of any known groupings in the 
student body. 

The final selection of the members to comprise the sample 
would, of course, be made according to one or more of the 
methods of random sampling described above. The concrete 
Sanne, would be somewhat as follows : 

a) Determination of the significant kno i ith 
the number of persons in phi ae 

(b) Determination of the minimum size of the sample. 
The smallest number which will include all the elements 
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represented in our known groupings would, of course, be 
one representative of our smallest group, and all others in 
proportion. But if this number makes a proportionate rep- 
resentation impossible, we have to select the highest ratio 
which will achieve this result. This ratio can be determined 
by finding the greatest common divisor of all the numbers 
representing our lowest sub-groups. In our illustration this 
number was 25, and each group would therefore be repre: 
sented in the sample in the ratio of 1 to 25. This procedure 
is, of course, a purely mechanical one for determining, the 
relative size of the samples from different groups. The abso# 
lute minimum size of the sample would have to be deter- 
mined by practical considerations to be discussed below. 

(c) When the number to be selected from each.group has 
thus been determined, this number of items Ser e drawn 
according to any of the methods of random sampling de- 
scribed in am'earlier section. Thus in our illustration, names 
of all the female freshmen in the college of education (400) 
would be placed on cards and shuffled. Sixteen names would 
then be drawn at random or at regular intervals. The same 
process would be followed for each group. The combina- 
tion of all of these drawings or selections of sub-samples 
would then be a representative sample of the whole body of 
data considered. This sample would be strictly representa- 
tive with respect to the elements taken into consideration in 
the selection—in our illustration: sex, college, and class. 

By inference we might also consider it representative in 
many other ways. Thus we might expect such a sample to 
be typical for nationality, income, and religious affiliation of 
the student body. But in the absence of previous knowledge 
of the range and distribution of these characteristics in the 
student body, we could test the proposition only by one or 
more of the methods to be tliscussed in the next section, In 
the illustration used, the principle of relative size of sub- 
samples has been observed. The absolute size of the sample 
which we might actually select for a given purpose, however, 
might necessarily be larger than the minimum at which we 
arrived above. As a principle of absolute size, Chapin has 
stated that each sub-class “should be large enough to repre- 
sent the character and range of variation among all the items 
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of this particular class.” 1! Thus in our illustration we in- 
cluded only two items from our smallest group (senior women 
in the college of business administration). If we expect to 
use our sample as representative of this group in nationality, 
income, and religious affiliation, a sufficiently large number 
should be included at least to cover the range of nationali- 
ties, religions, and incomes known to exist in the student 
body. This would probably necessitate including eight or 
ten persons from this group, and a correspondingly large 
sample from the other groups in proportion to their respec- 
"tive sizes. “Except when a census is by definition merely an 
inventory, small numbers in the cells of a table, even when 
derived from a complete tabulation, have no more signifi- 
cance than they would if they were derived from a sam- 
ple.”2 So the absolute size of the sample must be deter- 
mined by a variety of considerations, chiefly with reference 
to its purpose and to our knowledge of the composition of 
our whole group. “When the sample is to provide estimates 
for small or highly segregated items, such as the higher in- 
come classes, for example, it is often desirable to augment 
the sampling for this group by selecting several times as 
many cases as would be selected otherwise. Theoretically, 
the sample should include an equal number of cases for each 
important subgroup which is to be analyzed separately un- 
less there is considerable difference in variability among sub- 
groups. If the variability of a subgroup is greater than that 
of other subgroups, it requires a greater number of cases in 
the total sample so that estimates for all subgroups will be 
equally reliable. This is not always feasible in practice, but 
the sample which includes all subgroups proportionally is 
not the best sample (apart from practical considerations) un- 
less the subgroups are of the same order of variability and 
are either of comparable size or not intended to be used sepa- 
rately in the later stages of the study. When subgroups are 
selected with different sampling ratios, appropriate adjust- 
ments or weightings are applied to each in making up esti- 
mates for the entire aggregate.” 13 


UF. S. Chapin, Field Work and Social Research, (Cent i 
12 F, F, Stephan, W. E. Deming and M. H. adis op. vite Pp Eee. 


18 “Practi i 
ji F. F. Stephan, “Practical Problems of Sampling Procedure, op. cit, p- 
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3. Mixed sampling—When the items of the sample are 
selected at random from individual groups which already 
have been chosen “purposively,” proportionally, and/or 
from stratifications of the universe, this has been called the 
mixed type of sampling because it employs both the random 
and the purposive methods. Thus, if the representatives of 
each of the smallest subgroups in the above illustration of a 
student sample are selected by strictly random methods, that 
illustration would serve also as an illustration of the mixed 
method. Another illustration can be drawn from the cur- 
rent polls of public opinion. The more reliable of these 
polls have abandoned the practice of sending post card in- 
quiries to large numbers of people selected from telephone 
directories or other sources, which was mistakenly hoped 
would give a “random” sample. Instead, the population of 
the country is first “‘stratified” according to geographic loca- 
tion, sex, rural, urban, (further divided according to size of 
city), age, income, and party affiliation. The sample is then 
selected as nearly as possible at random within each of these 
groups, and in their proper proportion, so as to correspond 
to the whole voting population in all of the above respects. 
That is, if one-tenth of all the eligible voters in the country 
are Republican males, aged go to 45, living in New England 
cities of 25,000 to 100,000 population, with incomes between 
$3000 and $5000, one-tenth of the people in the sample 
chosen should have these characteristics. By this process, 
results accurate within g percent* on most subjects can be 
secured from a sample of 5000 (about .o1 percent of the vot- 
ing population), whereas a sample of over two million by the 
post card method as employed by the Literary Digest in 1936 
erred by more than 10 percent in its estimate of the popular 
vote. t 


D. THE SIZE AND RELIABILITY OF THE SAMPLE 


We have been primarily concerned in the present chapter 
with the representativeness of a sample rather than with its 
adequacy. Adequacy deals with the variation we may ex- 
pect owing to the chance conditions of sampling. As such it 


14Elmo Roper, “Sampling Public Opinion,” Jour. Amer. Statis. Asso., 
85:331, 1940. 
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is a problem of mathematics, whereas representativeness is 
more of a problem in applied logic. The problem of ade- 
quacy is very directly connected with the size of the sample 
and the theory of probability. For a full discussion of these 
subjects, the reader is referred to any standard text on statis- 
tics or mathematics.!® We treat here only its more elemen- 
tary aspects. 

Any random sample, no matter how carefully selected, 
tends to differ to some degree in its composition from the 
composition of the whole body of data from which it is se- 
lected. This is due to the operation of chance in selection. 
The size of this chance variation depends on several things. 
First, the greater the homogeneity of the data, the less will 
be the tendency of any sample, regardless of its size, to vary 
from the whole. This is necessarily so, since homogeneity 
means that the range of variability is small. Obviously a 
sample selected from a given age-group of males would show 
less variability than successive samples of like character and 
size selected from the whole population. That is, the prob- 
able error of the mean in the latter case would be greater. 
Second, the degree to which the average of a sample tends to 
vary from the average of the whole body of data depends, 
other things being equal, on the size of the sample. If we 
wish to determine the mean height of an indefinitely large 
sample of men, our chance of closely approximating the true 
mean is considerably greater if we select a sub-sample of 45 
rather than a sub-sample of 5. In fact, it can be mathemati- 
cally demonstrated that in this illustration the larger sample is 
likely to be three times as accurate as the smaller? Assume, 
for example, that the difference between the mean of the five 
cases and the mean of the original sample was g inches. To 
reduce this difference to 1 inch, it would be necessary to 
take not 3 X 5 cases, but 9 X 5 or 45 cases. (The reliability 
of the sample increases directly as the square root of the 
size of the sample.) That is, the probable error (the devia- 

aie Ge eee Op. cit, pp. 213, 384. 
By the formula for the Probable Error of the mean ( P oo :) See R. 


745 7 
E. Chaddock, o; va 


pth ey p- cit., ch. 11, especially PP- 237-46, or any standard text on 
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tion from the true mean of the original sample) of the mean 
calculated from the 5 men would be likely to be about three 
times as large as the probable error of the mean calculated 
from the larger sample. 

In somewhat the same way the standard error of the pro- 
portion shown by a random sample (from an indefinitely 
large universe) may be computed by the formula ** 


Standard Error = A / me 


in which 
n =the number of items in a sample from a uni- 
verse. 
p' = the proportion in a sample having the given 
attribute. 


By this formula it can be demonstrated that unless the 
proportion of items in the universe having some specified 
attribute is very small, the chances are two out of three that 
the actual proportion in the whole group, the universe, will 
not differ from p’ by more than the amount given by the 
formula. For example, if in a sample of sixteen students, 
eight, or 50 percent, are found to be freshmen, then the 
chances are two out of three, assuming random selection of 
the sample, that the proportion of freshmen in the entire 
student body does not vary from 50 percent by more than 
12.5 percent. The 12.5 is obtained by inserting the value of 
p’ (14) and the value of n (16) in the above formula, thus : 


S.E. = LOD = A/S org = 58 125% 


Hence the chances are two out of three that the true propor- 
tion of freshmen lies between 37.5 and 62.5 percent of the 
total. 

The theory of probability, the derivation of probability 
formulas, and their application to concrete cases are beyond 
the scope of the present discussion, and must be left to 
statistical and mathematical treatises on the subject. We are 


18 See Yule and Kendall, op. cit., Chap. 19. The “Probable Error” is given by 
the same formula as in the text above, if the term 2/3 is inserted before the 
radical. The interpretation would then be that the chances are even ("50-50") 
that the proportion of freshmen in the entire student body does not vary from 
xo percent by more than 8.5 percent. 
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interested in the present connection in pointing out merely 
the nature of the relationship between the size of the sample 
and the degree of stability in the measures obtained from the 
whole universe or body of data. Neither the actual size of 
the sample nor the size of its probable error necessarily guar- 
antees its representativeness. As Chaddock has shown, the 
method of drawing the sample may result in a fundamental 
bias, which a mere increase in the size of the sample can- 
not, of course, eliminate” This bias can be eliminated 
only by considerations of logic in the technique of defining 
the universe and selecting the sample. In the same way, a 
small probable error in no way necessarily guarantees the 
representativeness of the sample. It indicates only the ade- 
quacy of the sample to take care of chance variations, which 
would be present whether the sample is representative or 
not. If, however, the measures computed from successive 
samples chosen in a similar manner show a considerable fluc- 
tuation—if we find repeatedly that the averages or other 
measures fluctuate by more than three times the Standard 
Error, as computed above, it is a warning that other than 
chance factors are causing variations. The entire procedure 
of sampling should then be reviewed. It is in this indirect 
way, in furnishing an index to the probable reliability of the 
sample and the degree of significance to be attached to the 
results, that the measures of probability are of great impor- 
tance in sampling. 


k We may conclude, then, with reference to the absolute 
size of the sample : 


1. A large sample is more likely than a small sample to include 
all the significant characteristics of the universe in question if 
the method of sampling has been random in both cases. 

2. The stability of measures computed from a sample increases 
as the square root of the number of items composing it. 


_ §- The lower limit of the sample is the smallest number which 
includes all the essential elements.2° 


4. The upper limit is the largest number that ientl 
or conveniently handled with the available cee 7 
E A sample of large size or small probable error, or both, does 
not necessarily establish the representativeness of the sample. 
WR. E. Chaddock, op. cit., p. 245. 


Cf. F. S. i . cil 
a maa S. Chapin, op. cit., p. 122. 
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E. PRACTICAL TESTS OF THE ADEQUACY OF A SAMPLE 


The problem of the size and representativeness of the sam- 
ple is obviously of great practical concern to any investigator. 
On the one hand, we wish to generalize from our analysis of 
the sample, and wish to be as sure as possible that what we 
find true of the sample would hold true of the whole group 
from which it is chosen. On the other hand, we wish to ex- 
pend the least time, money, and effort consistent with the 
attainment of our result. As we have seen above, when the 
composition of our whole group is known the minimum size 
of the sample may be mathematically determined. But where 
this composition is not known, which is frequently the case 
in any investigation, and always true in the case of our small- 
est sub-sample, we must satisfy ourselves of the adequacy and 
representativeness of our sample by empirical methods. 

Perhaps the simplest and most practical empirical method 
of testing the adequacy of a sample is to take several other 
samples of about the same size, by the same or a variety of 
methods, and compare them with the sample selected for in- 
vestigation. If the results are not sufficiently alike or stable 
to accord with the standard of accuracy previously deter- 
mined, it indicates that the size of the sample should be in- 
creased until the successive samples show substantial similar- 
ity, that is, until most of the variations in each sample are 
small and occur as frequently in one direction as in the 
other? The degree of similarity of successive samples which 
we insist upon will be determined, of course, by the standard 
of accuracy we have adopted for our investigation. 


F. GENERAL CAUTIONS IN SAMPLING 


Bowley has well said that “no formal rules can replace 
judgment and experience in the selection and interpretation 
of samples.”** All that rules can do is to cause the investi- 
gator to be wary of the more common and obvious errors, 
and to help him avoid extravagant generalizations from his 
findings when the validity of the sample is doubtful. Broad 


generalizations from inadequate and non-representative data 


22 Ibid., p. 125. 
3 A. L. Rowley, op. cit., p. 62. 
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drawn from personal experience are perhaps one of the com- 
monest sources of error in current social thinking. The pub- 
lic press is full of reports of investigations proving diametri- 
cally opposite conclusions, and frequently the explanation is 
to be found in the methods of sampling employed. Fre- 
quently, of course, the mis-selection of the sample is delib- 
erate in order to prove a case for advertising or other propa- 
gandist purposes. But often ignorance of the principles of 
valid sampling is the cause. 

As Stephan has said: “In some situations it may be possi- 
ble to choose a sample which is superior to a random sample. 
The advantages of deliberately selecting items for a sample, 
as compared with selecting at random within relevant cate- 
gories or strata, probably cannot be decided in general terms. 
Deliberate selection is preferable if the investigator has rea- 
sonably complete knowledge about each item. In practical 
situations, however, he can seldom be sure that he will not 
unwittingly introduce serious biases in choosing the items. 
Expediency may dictate the selection of certain cases in the 
sample. ‘There is then nothing for the investigator to do 
except to choose as wisely as he can and to keep a wary eye 
open for clues as to the possible biases that result. The prob- 
lem focuses on the nature of the social or economic system in 
which the sampling is conducted, the kind of relations or 
regressions between relevant factors, and the extent to which 
the investigator knows about them. With complete knowl- 
edge he can pick a perfectly representative sample, with com- 
plete ignorance he cannot avoid picking a random one, but 
with a little knowledge he may be badly misled.” Many 
defects in sampling are due to the common human tendency 
to select the data which are most convenient and available.” 
For example, Bowley has shown the possible error in taking 


as F. Stephan, “Practical Problems of Sampling Procedure,” op. cit., pp- 


15 For an excellent example of defective i p 
sions drawn from the sam; S see "The hry, id As Epa pob. 
lished by the National Civic Federation (Industrial Welfare Department), 
See also I. M. Rubinow, “Where Will You Be At 652° (New Republic, 
sp stot, 1928) for an excellent criticism of this study, sample consists 

about one-fourth of one percent of the aged lation of the country, all 
pa coey - oe cities = ar states on t tlantic seaboard. It neglects 

all a alrea: i i i ion- 
ate number of penons of wiidiedais panee eoe =s mi s pD 


THE SAMPLE 151 


a sample of school children in attendance on a given day if 
our purpose is to determine the physical condition of school 
children2® Chaddock has likewise illustrated the error pos- 
sible in a sample of factory workers’ family budgets when the 
accounting required is such that only the members of a cer- 
tain select group would probably be able and willing to fur- 
nish an account of their budgets.” 

We have already mentioned the biases that may result 
from geographic inaccessibility, absence of the people to be 
interviewed, their illness, their inability to speak English, etc. 
Then there is always the possible bias in samples because 
some informants refuse to cooperate. If these people do not 
differ from the rest in any other way than in their refusal, 
there is no bias. But refusal may be associated with charac- 
teristics highly important to the investigation: age, educa- 
tion, income. If so, omitting such cases may cause a serious 
bias. Samples selected by telephone interviews have been 
shown to be biased against the lower income groups.” The 
selective bias in questionnaires sent out broadcast, and re- 
turned in only small proportions is subject to this error. 
Roper reports that in public opinion polls, at least, “older 
people answer mail ballots more readily than young people ; 
that the prosperous and well-to-do return a great many more 
ballots than the poor people, and that the people with vio- 
lent emotions, particularly with violent emotions against 
whatever is going on, return mail ballots in greater percent- 
age than do the people of more moderate sentiments. There- 
fore, even though the Literary Digest was able in some way 
to get its ballots to a true cross-section of the public, it was 
sure that a true cross-section of the public was not returning 
the ballots. Their returns were heavily weighted with the 
old who were less strongly pro-Roosevelt than the young, 
heavily weighted with the prosperous who were more strongly 
pro-Landon than the poor, and also weighted with a large 
return from violent critics of the administration who, feeling 

26 Ibid. P- 57- 4 

pat Chaddock, y ĉit., p. 386. . 

3P, Lazarsfeld, Radio and the Printed Page, (Duell, Sloan and Pearce, 


1940), pp- 13, 32 fi. See also, by the same author, “Interchangeability- of 
Indices in the Measurement of Economic Influences,” Jour. Applied Psychol., 
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themselves unrepresented, wantedto cast a mail ballot or 
write a letter to the editor, or to do something to show their 
disapproval.” *° 

It is impossible to give a list of cautions against all the pos- 
sible errors of sampling that may occur even in the more 
conventional type of survey or investigation. Knowledge of 
rules is no substitute for intelligent alertness. Each investi- 
gation may be in some respects unique. For example, inter- 
views with the inhabitants of every tenth house along the 
streets of a city may give a representative sample regarding 
certain characteristics. But, if the streets are divided into 
blocks by cross-streets at every tenth house, so that an undue 
proportion of corner houses are included in the sample, the 
results may be far from representative. Corner houses may 
belong to families of higher economic status, or such houses 
may be shops with families living upstairs. To avoid such 
difficulties all house numbers for the areas to be sampled 
might be placed on cards, shuffled, and drawn after the man- 
ner of lottery sampling. Or, sample blocks or segments of 
blocks may be selected on the basis of other known informa- 
tion about the area.” By defining a block segment as one 
including every house from a specified house number to (but 
not including) another specified number, houses not listed 
in directories would have an equal chance to be included. 

In our discussion of sampling we have assumed for the 
most part that the practical questions to be answered would 
specify, at least implicitly, what universe is to be sampled. 
It is desirable to be explicit on this point because a conclu- 
sion may be valid for the sample in some respects but not in 
any or all other respects. A given sample may represent sev- 
eral different universes, although it represents one or some 
of them much better than others. Stephan illustrates the 
matter admirably as follows: 

“Questionnaire returns from a group of office workers who 
are employed by a public utility corporation may be regarded 
as representing (1) those office workers in the corporation 
who are disposed to answer questionnaires, (2) all office work- 


2 Elmo Roper, “Sampling Pub inion,” À 
a eat 1940. pling Public Opinion,” Jour. Amer. Statis. Asso. 
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ers employed by utilities in the same city at the time the 
sample was taken, (3) all"office workers employed by this cor- 
poration during a period, say five years, preceding (or fol- 
lowing) the time the sample was drawn, (4) all office workers 
in this city, (5) all ‘white collar workers,’ (6) all utility em- 
ployees, (7) all residents of the city or area covered by the 
sample, (8) all residents of similar cities, or any one of many 
other universes which may be constructed or defined for the 
purposes of drawing general conclusions from the sample. 
The entire socio-economic aggregate may be sliced in a mul- 
titude of directions, and the sampling relationship may be 
assumed to exist between a sample and any appropriate slice. 
Some of these possible universes are quite similar, and inves- 
tigators frequently shift their discussion or conclusions from 
one universe to another without justifying the change of ref- 
erence or satisfying themselves that the sample is a relatively 
unbiased representative of the second universe as well as the 
first. This is a weakness common to non-statistical studies 
quite as much as to statistical investigations. In the case of 
the latter, measures are available with which each universe 
may be specified in precise terms.” 31 


G. SUMMARY 


è 
In conclusion we may summarize the steps of the sampling 
procedure as follows : * 


1. Choice of the universe or population to be sampled. All 
voters or all adults? Native white families only? etc. 

2. Decision on a suitable means of access to the population to 
be sampled. House-to-house canvass? By mail? etc. 

3. Choice of a unit of sampling. Individuals or households ? 
Block, census tract, ward, or township? etc. y 

4. Choice of a method of selecting the sample. Selection by 
persons very familiar with the popu ation to be sampled? Divi- 
sion of the population into strata known to exist, (€.g., Age, SEX, 
economic status) relevant to the inquiry? Various random meth- 
ods discussed above ? 

5. Actual selection of the sample. “Drawing” of numbers, 
selecting houses to be visited, checking sample secured against 
reliable information already known to detect inadequacies and 

81 F, F. Stephan, op. cit 574775 

32 Adapted from F. F. Stephan, “Representative Sampling in Large Scale 
Surveys,” Jour. Amer. Statis. Asso., 34:3475% 1939- 
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biases, adjustments to be made in final results as partial correc- 
tions of these biases, etc. ~ ` ‘ 

6. Securing data for the units in the sample. Observe, inter- 
view, or write to individuals, or transcribe data from records se 
lected by sample. 

7. Tabulation and analysis of sample data. : 

8. Application of the sample data to the problem under con- 
sideration. Considerations of reliability, size of errors due to 
“chance” variation, general adequacy and validity of the data. 


We have limited discussion.in this chapter only to the 
more elementary.and practical problems of certain types of 
sampling, especially sampling of populations. But we may 
also be interested in sampling.time, space, or elements con- 
stituting cultural concepts. For example, when our data 
comprise a period of time or even a date, we are really deal- 
ing with a sample of all time. It is not a random sample and 
hence the logic of ordinary standard errors does not apply. 
It is well to keep in mind, however, in our conclusions, that 
our inferences regarding trends are based on a sample of 
time, and any projection of these trends into the future will 
4 don the adequacy and the validity of our time sample. 

ny region studied is likewise only a part of a larger region. 
Finally there is the sampling involved in the construction of 
the fests and scales (discussed in a later chapter). Thus 
Chapin, in the construction of his scale%* for measuring 
socio-economic Status, selects four general components (cul- 
i equipment, effective income, material possessions, and 

al participation) of status from the whole complex of cul- 
ture, and then selects a small number of specific items as 
samples of these four components. S, C. Dodd," likewise, in 
his construction of a scale for measuring rural hygiene in 
Syria first formulated some 2000 questions regarding hygiene, 
and gradually reduced their number by selecting those which 
correlated highly with the full scale. 

These direct techniques of situation sampling of culture 
and behavior are just as important as the more familiar sam- 
pling of people. To be sure, our sampling of people in the 
social sciences is usually on the basis of thessumption that 


SF. S. Chapin, Contemporary American Instituti. 
"Se sc meer IX of the present text. ne oan VARIER, 1996), Chap. 
wae roe oof Controlled Experiment on Rural Hygiene in Syria (Oxford 
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they are the carriers of the culture or behavior studied. -But 
in some types of investigation and, experimentation we are 
interested not only in the representative nature of the popu- 
Tation, as such. We are also interested in the behavior sam- 
ples secured from that population. For example, we may 
observe during definite intervals of time, the occurrence or 
non-occurrence of certain specified behavior in a group of 
children. The question of how representative and adequate 
are the time samples we have taken as compared with what 
constant observation of the same group would reveal, is here 
obviously a problem in time sampling.*® Likewise, in a study 
of traffic problems we are interested not only in the repre- 
sentativeness and adequacy of the city or the part of the city 
where the observations are made, but also in the validity of 
the time samples during which observations were made. 

Practical necessities frequently compel us to proceed in 
investigations when we know that our sampling methods are 
far from ideal and that our sample is therefore probably de- 
fective. Provided we do not make rash generalizations from 
such studies there is, of course, no objection to them. On 
the contrary, they may be of great suggestive value and fur- 
nish hypotheses for further investigation. A compilation of 
typical examples of sampling employed in notable research 
projects would have great suggestive value, as would a com- 
pilation of examples of defective sampling. Each new situa- 
tion, however, may require a new type of sampling. Resi 
sourcefulness, imagination, and critical insight are always 
necessary in selecting a representative sample. 


H. CONCLUSION 


We have pointed out in this chapter some of the ways in 
which it is possible to secure reliable knowledge of a vast 
universe by acquainting oneself with a sample of it. As 
we have seen, the validity of our assumptions about the 
whole from a study of a part, depends upon the representa-* 

35 For further discussi f such techniques, see W. C. Olson and Elizabeth 
Cunningham, “Time pling Techniques,” Child Development, 5:41-58, 
1934. Florence Goodenough, “Measuring Behavior Traits by Means of Re- 
peated Short Samples,” Jour. Juvenile Research, 12:290-35, 1928. Ruth E. 


Arrington, “An Important Implication of Time Sampling in Observational 
Studies of Behavior,” Amer. Four. Sociol., 43:284-95, 1937- 
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tiveness of that part. The technique of selecting a sample 
so that there is high probability that it will be representative 
has been our chief concern. 

The dependency of men in all walks of daily life upon the 
validity of their generalizations from samples is obvious. 
Were it not for the validity of our assumption that what we 
have found to hold true in one situation will also be true in 
other situations which we consider similar, our reactions 
would be chaotic, and would consist of blind trial-and-error 
gropings. We learn in actual life to rely upon the theory of 
sampling long before we formally become aware of its exist- 
ence. Unfortunately, the simple, primary-group, rule-of- 
thumb sampling methods that everybody uses from child- 
hood frequently mislead us when we attempt to project 
them into more remote fields of secondary-group adjustment. 
The reason is, of course, that in primary group relationships 
we know a good deal about the universe we are sampling. 
The opinions of two or three people in our own community 
are promptly evaluated from the standpoint of the repre- 
sentativeness of these persons—the bias they represent, and 
the degree to which they probably represent community sen- 
timent. The opinion of the preacher and the banker as well 
as that of the village “bum” is promptly evaluated from the 
standpoint of bias and representativeness. In a strange city 
or in a foreign country, however, we are prone to accept the 

_ opinion of any taxi-driver as reliable information about the 
social, economic, and political conditions in the whole coun- 
try. If our informant is a prominent person, whether in 
poetry, chemistry, or journalism, we forget even more com- 
pletely the elementary rules of valid sampling. 

This naive projection of local primary group sampling 
methods into secondary group fields is exemplified also by 
the politicians, journalists, and tourists who are unable and 
unaccustomed to interpret sources of comprehensive infor- 
mation about a foreign country, and hence decide to spend 
two or three weeks there to “get the real facts.” The inci- 
dents which they experience and report, no matter how 
unique and no matter how small a minority is affected by 
them, become the stereotypes on the basis of which whole 
nations are condemned as “barbarian,” “godless,” or what 
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not, or, conversely, are glorified as “kindly,” “generous,” 
“honest,” and “democratic.” These illusions frequently con- 
tribute not a little to misunderstanding, social tension, and 
war. The “cases” of social agencies and especially the char- 
acters and situations portrayed in novels and the movies are 
widely accepted as representative. The novel and the play 
are, in fact, felt by many to be the ultimate source of infor- 
mation about social and psychological matters. They may 
or they may not be. It depends on such criteria as we have 
discussed in’ this chapter, not upon whether the tale “rings 
true.” It may ring true merely to our previously formed 
biases, and if so, our illusions are only confirmed by their 
corroboration from “literature.” Much has been written 
about the mischief which stereotypes play in human rela- 
tions. Perhaps one of the basic attacks upon this problem 
is a more general diffusion of knowledge of the elements of 
valid sampling techniques. 


I. SUGGESTIONS FOR FURTHER STUDY 


1. The best brief supplementary reading on the aspects of 
sampling treated in this chapter will be found in G. U. Yule 
and M. G. Kendall, An Introduction to the Theory of Sta- 
tistics, (Chas. Griffin and Company, London, 1937), Chap. 18 
on “Preliminary Notions on Sampling.” Detailed treat- 
ment of the statistical-mathematical aspects of large and small 
samples will also be found in Chaps. 19-23 of this excellent 
book. The statistics of small samples is perhaps most ade- 
quately treated in R. A. Fisher, Statistical Methods for Re- 
search Workers, Oliver and Boyd, (Edinburgh : 1936). See 
also his The Design of Experiments, Oliver and Boyd, (Edin- 
burgh : 1937). For a more elementary treatment of the sta- 
tistics of sampling, see F. E. Croxton and D. J. Cowden, 
Applied General Statistics, (Prentice-Hall, 1940), Chap. 12. 
Also R. E. Chaddock, Principles and Methods of Statistics, 
(Houghton Mifflin, 1925), Chap. 11. 4 

2. An interesting brief description of the sampling tech 
nique in the United States Census of 1940 will be found in 
F. F. Stephan, W. E. Deming, and Morris H. Hansen, “The 
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Sampling Procedure in the 1940 Population Census,” Jour. 
Amer. Statis. Asso., 35:615-30, 1940. See also F. F. Stephan 
and W. E. Deming, On the Sampling Methods in the 1940 
Population Census, Bureau of the Census, 1941. 

3. Further details on the sampling techniques of the prin- 
cipal public opinion. polls will be found in George Gallup 
and S. F. Rae, The Pulse of Democracy, (Simon and Schus- 
ter, 1940). A briefer treatment of the same subject is Elmo 
Roper, “Sampling Public Opinion,” Jour. Amer. Statis. ASSO., 
35:325-34, 1940. 

4. Descriptions of the sampling problems encountered and 
the techniques adopted in concrete investigations of differ- 
ent kinds will be found in the following sources : 


(a) Erika H. Schoenberg and Mildred Parten, “Meth- 
ods and Problems of Sampling Presented by the Urban 
Study of Consumer Purchases,” Jour. Amer. Statis. 4ss0., 
32:311-22, 1937. 

(b) The National Health Survey, Preliminary Reports : 
Significance, Scope and Method, (Washington: U. S. Pub- 
lic Health Service, 1938). 

(c) Calvert L. Dedrick and M. H. Hansen, “The 
Enumerative Check Census,” Census of Partial Employ- 
ment, Unemployment and Occupations; 1937. (Wash- 
ington, 1938), pp. 159 ff. 

(d) C. F. Sarle, “Development of Partial and Sample 
Census Methods,” Jow. Farm Economics, 21:357-64) 
1939; Irvin Holmes, “Results of Four Methods of Sam- 
pling Individual Farms,” Ibid., 365-74. 

(e) Paul F. Lazarsfeld, The American Station Sampler 
and How to Use It, (Office of Radio Research, Columbia 
University, 1940). 


5. Excellent supplementary reading on the general and 
practical aspects of sampling will be found in two articles 
by F. F. Stephan, “Representative Sampling in Large-Scale 
Surveys,” Jour. Amer. Statis. Asso., $4:343-52, 1939; and 
“Practical Problems of Sampling Procedure,” Amer. Sociol. 
Review, 1:569-80, 1936. See also S. A. Stouffer, “Sociology 
and Sampling,” Fields and Methods of Sociology, Edited by 
L. L. Bernard, (Long and Smith, 1934), Chap. 15. 


CHAPTER VI 


THE SCHEDULE AS AN INSTRUMENT OF 
OBSERVATION 


The next care to be taken in respect of the Senses, is a supplying of their 
infirmities with Instruments, and, as it were, the adding of artificial Organs 
to the natural; this in one of them has been of late years accom lished with 
prodigious benefit to all sorts of useful knowledge, by the invention of ti- 
cal Glasses... By this the Earth itself, which lyes so neer us, under our feet, 
shews quite a new thing to us, and in every little particle of its matter, we 
now behold almost as great a variety of Creatures, as we were able before to 
reckon up in the whole Universe itself, It seems not improbable, but that 
by these helps the subtility of the composition of Bodies, the structure of 
their parts, the various texture of their matter, the instruments and manner 
of their inward motions, and all the other appearances of things, may come 
to be more fully discovered. — Robert Hooke.t 


A. THE NEED OF INSTRUMENTS OF OBSERVATION IN THE 
SOCIAL SCIENCES 


Man is usually credited with being the only tool-making 
and tool-using animal. To this proclivity he owes his dom- 
ination over other animals and over his environment gen- 
erally. Man’s first concern with tools was perhaps with their 
usefulness as extensions of his more overt muscular powers. 
But he soon developed rude devices to guide and extend his 
observations, and to check them objectively. When the sav- 
age put his ear to the ground the better to detect sounds, or, 
shaded his eyes with his hand to aid his vision, he made the 
first rude gestures in the extension of his stimuli receptors, 
the highest form of which we now find in our most powerful 
microscopes, telescopes, and various delicate instruments of 
sound detection. When he gauged the time of day by the 
position of the sun relative to a tree, the passage of longer 
periods of time by the phases of the moon, and distance in 
terms of a day's march, he was beginning to check his sub- 
jective impressions by objective criteria. Man’s progress to- 
ward more delicate and accurate adjustments has consisted 
of the increasing refinement of these mechanical devices for 
extending and objectifying his observations. In physical 

1 Micrographia, 1665. (Preface, pp- 8, 9 Alembic Club Reprints, William 


F. Clay, London and Edinburgh, oad 
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science that refinement has reached the highest point yet at- 
tained in any field. 

The importance of supplementing and extending the 
power of our stimuli receptors or “senses” by mechanical 
devices was stressed in a previous chapter. No fact in the 
history of science stands out more clearly than the close de- 
pendence of scientific advancement on the invention of 
mechanical aids to observation. In the hands of Galileo the 
telescope revolutionized the then existing notions of the uni- 
verse. Recent developments in telescopic, photographic, 
and spectroscopic instruments enable us to make physical 
and chemical analyses of the heavenly bodies. Numerous 
phenomena which for centuries puzzled the ablest minds be- 
cause their instruments of observation were inadequate are 
now well understood. Without the development of the mi- 
croscope, bacteriology could not have developed at all and 
other biological sciences would have been seriously handi- 
capped. Many overlook how completely the imposing edi- 
fice of physical science rests upon the intricate and ingenious 
mechanical devices we have invented to extend and objec- 
tify our powers of observation. Continued advancement in 
our knowledge, furthermore, must depend very largely upon 
the increasing refinement of these devices. 

As was also pointed out in a previous chapter, one of the 
reasons for the low state of development of the social sciences 
is the comparative absence of accurate and intensive observa- 
tion devices. From the rôle which these devices have played 
in the development of the physical sciences, we may assume 
that our first task in the social sciences is the improvement 
of our mechanical aids to observation. Our chief reliance 


so far has been on some form of a schedule. This device is 
an aid to observation in several ways. 


B. THE IMPORTANCE OF THE SCHEDULE AS AN AID TO 
OBSERVATION 


In the first place, the schedule extends enormously the in- 
vestigator's powers of observation. It does this by reminding 
the observer of each item which is to be looked for and 
reported on. The schedule insures that the observer makes 
note of the same items of information in all the cases, and 


THE SCHEDULE 161 


yet makes it unnecessary for him to trust his memory for the 
facts observed. Conclusions based on a large number of ob- 
servations without the assistance of a formal schedule are 
notoriously untrustworthy. An inspection of even a small 
number of industrial establishments, for example, without 
some such schedule as illustrated on page 170 would enable 
the investigator to give only a very vague and inaccurate acy 
count of his observations. In the first place, he would fail 
to observe many of the items called for on the card. In the 
second place, he could not remember the number of cases 
in which a particular item was present or absent or the degree 
to which it was present or absent. The investigator would 
tend to observe and remember only the unique, exceptional, 
or unusual features, or the aspects in which he was especially 
interested. The conflicting reports of visitors and “‘investi- 
gators” of Russia in recent years furnish excellent examples 
of what happens when we attempt to carry on observations 
without the aid of some objectifying device. Visitors to the 
same place at the same time have brought back radically dif- 
ferent reports of what they saw. The notions of almost any 
community on changes in climate, on the comparative se- 
verity of winters, etc., are examples of the same error in gen- 
eralizations which are not based on objective recording of 
the facts. A check of the records of the Weather Bureau will 
show in most cases that general impressions are not supported 
by the facts. It is in the elimination of such errors that the 
schedule serves as an important tool in extending and vali- 
dating the observation of social facts. A 

In the second place, the schedule tends to standardize and 
objectify the observations of different observers. That the 
facts observed shall appear the same to all qualified observ- 
ers, is, as we have seen, one of the primary requirements of 
science. ‘The schedule helps to achieve this result (1) by 
singling out the particular aspects of a situation which are 
regarded as significant for the purpose 17 hand and calling 
attention to these ; and (2) by specifying in advance the units 
and terminology in which the observations are to be de- 
scribed. As we have seen in a previous chapter, in order to 
make a unit mean the same to different persons we objectify 
it in some way by referring it to some external measuring 
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device, such as a scale, a yardstick, etc. That is, we tend to 
describe Our observations quantitatively instead of qualita- 
tively. The importance of quantitative description in science 
has been emphasized in previous chapters. 

Finally, the schedule is a device for isolating one element 
at a time and thus intensifying our observation of it. This 
again is one of the techniques on which physical science has 
depended for much of its advancement. The division of a 
mass of impressions into its constituent factors is the essence 
of all analysis, and only through such a separation of factors 
can sequences be observed arid behavior understood. 

The careful construction and use of schedules, therefore, 
becomes one of our most important problems in the observa- 
tion of social phenomena. 


c. TYPES OF SCHEDULES 


For convenience in discussion we shall divide schedules 
into three classes: (1) Schedules for the recording of objec- 
tive facts (a) from the direct observation of the investigator, 
(e.g., Fig. 2), (b) from interviews with people who possess 
the desired facts (e.g., a birth registration schedule, Fig. 1, or 
the United States Census population schedule, Fig. 2), and (c) 
from “questionnaires” filled out by people without personal 
interview by an investigator ; (2) schedules for the determina- 
tion and measurement of attitudes or opinions, (see Chap. 
VII) and (3) schedules for the scoring of the status and func- 
tioning of social organizations and institutions so that they 
may be compared or measured relative to some standard and 
relative to each other (see Chap. IX). The above categories 
are not mutually exclusive, of course. Many schedules would 
fall into two or more of the classes given. The present chap- 
ter will deal with only the first of these classes. á 


D. SCOPE AND PURPOSE OF THE INVESTIGATION 


Before schedules can be intelligently prepared, the scope, 
plan, and purpose of the investigation must be decided upon. 
Some of the considerations involved in this preliminary work 
were discussed in the preceding chapters in connection with 
the discussion of the working hypothesis and sampling. We 
pointed out in Chapter I that too many projects simply un- 


Registrar's No.—_______—_—. 


O PIRETA STANDARD CERTIFICATE OF LIVE BIRTH So Fie Ne 


WHY BIRTHS SHOULD BE REGISTERED 


There is hardly a relation of life, social, legal, or economic, in which the 
evidence furnished by an accurate registration of births may not prove to be 
of the greatest value, not only to the individual but also to the public at 
large. It is not only an act of civilization to register birth certificates but 
good business, for they are frequently used in many practical ways: 


(1) As evidence to prove the age and legitimacy of heirs ; 

(2) As proof of age to determine the validity of a contract entered into 
by an alleged minor ; 

(3) As evidence to establish age and proof of citizenship and descent in 
order to vote ; 

(4) As evidence to establish the right of admission to the professions and 
to many public offices ; 

(5) As evidence of legal age to marry ; 

(6) As evidence to prove the claims of widows and orphans under the 
widows’ and orphans’ pension law ; 

(7) As evidence to determine the liability of parents for the debts of a 
minor ; 

(8) As evidence in the administration of estates, the settlement of insur 
ance and pensions ; 

(9} As evidence to prove the irresponsibility of children under legal age 
for crime and misdemeanor, and various other matters in the crimi- 
nal code; 

(10) As evidence in the enforcement of law relating to education and to 
child labor ; 
(11) As evidence to determine the relations of guardians and wards ; 
(12) As proof of citizenship in order to obtain a passport ; 
(1g) As evidence in the claim for exemption from or the right to jury 
and military service. 
Ficurr 1. A Simple Type of Schedule. 


— 


164 SOCIAL RESEARCH 


” 66 


dertake to study “unemployment,” ‘‘women in industry,” 
“relief,” “crime,” etc., without specifying clearly the exact 
aspects of these phenomena to be studied. Accordingly, 
schedules are hastily made up to consist of any or all ques- 
tions occurring to anyone as being “interesting.” Too many 
schedules are mere inventories of what someone would like 
to know. They fail to take into consideration the problem 
of what we may reasonably hope to get, and the best way of 
getting it. Inexperienced investigators are especially prone 
to rush into the field before the purpose of the research has 
been carefully thought through. Their schedules will be 
correspondingly vague and ambiguous. Social and economic 
Subjects have an indefinitely large number of aspects which 
may be investigated. A choice must be made. We may set 
it down as a first rule not to attempt the preparation of a 
schedule for an investigation until the purposes of the in- 
quiry have been reduced to specific questions. This does not 
mean that general information may not be gathered on as 
general subjects as is desired. It merely means that what- 
ever is desired should be explicitly stated. 

When it has been definitely decided what questions are 
needed, the next step is a careful consideration of just what 
data, when gathered and analyzed, would yield the answers 
sought. The availability of these data within the time and 
other resources of the investigation must then be considered. 
It may be known from the outset: (a) that some of the re- 
quired data are certainly available, (b) that certain other nec- 
essary data definitely are not available, and (c) that other data 
are of doubtful availability. If the questions call for un- 
available data, this fact had better be recognized in advance. 

therwise the time of the investigators will be wasted on a 
vain quest. When availability is in doubt, the point should 
be cleared up as far as possible by preliminary inquiry. 
it is certain the data will not be available to settle the ques- 
tions as originally propounded, it may be possible to rephrase 
them so that the original purposes can be achieved in whole 
or part through indirect or inferential channels. Historians 
and anthropologists, not to mention students of cosmic and 
terrestrial evolution, frequently cannot hope to find direct 
or concrete accounts of many of the events of the indefinite 
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past. But through study of contemporary data they can 
make fairly reliable deductions regarding what did occur, 
because of what we know to be probable under certain cir- 
cumstances. Data from which such inferences can be drawn 
are, of course, just as proper objects of inquiry as direct cen- 
sus information about birth rates, death rates, or unemploy- 
ment. Except in purely exploratory investigations it serves 
no purpose to’ proceed with an investigation*until the avail- 
ability of the data has been determined with some certainty. 
Hypotheses couched in such terms as to be unsusceptible of 
empirical test are metaphysical rather than scientific hy- 
potheses. ‘ 

Secrist ? has summarized this preliminary work in the form 
of nine questions which should be answered before the col- 
lection of data begins. While these points contemplate a sta- 
tistical investigation, they would apply with equal validity 
to any other form of inquiry : 


"1. What is the precise problem upon which statistics are re- 


quired ? i H, 
2. Does the problem as formulated lend itself to statistical 


treatment ? : : s 
3- What types of data are necessary for its analysis or solution ? 
4. Are they likely to be available in suitable form? 

5. Are they likely to be adequate for the purpose in mind? 
6. Will they have the required degree of accuracy, consistency, 


and comparability ? ai , Ges 
7. Can the data be made available within the time limit re- 


quired ; that is, have they the required currency ? 
8. Are there likely to be any restrictions upon the use of data. 


which will compromise the purpose which they are to serve? 
9. What sanction is necessary, and what method of procedure, 
with the sanction available, must be followed in order to secure 


the desired facts? 
E. THE CONSTRUCTION OF SCHEDULES 


Since the value of the results of an investigation will fre- 
quently be determined by the care with which the schedule 
is constructed, it is advisable to prepare it with great care. 
This is especially true in the case of the questionnaire. Some 
special considerations concerning this type of schedule will be 


2 Horace Secrist, An Introduction to Statistical Methods, (Revised edition, 
Macmillan, 1925), p. 47- 


ane SOCIAL RESEARCH 


considered in the next chapter. But even the enumeration 
schedule deserves careful construction from the standpoint 
of uniformity and reliability of the résults it will bring in, 
as well asefrom the standpoint of clearness to tabulators and 
interpreters of the results. The principal aspects of schedule 
making that demand attention, and which apply to all the 
forms mentioned above, may be summarized as follows : 


1. Items to include ” 


To determine what items to include in a schedule, a care- 
ful preli ry study of the problem is necessary. The 
temptation is to include all items whith come to mind as 
being “interesting” or “possibly of value.” But this tendency 
has to be avoided because in a comprehensive study, a single 
aie p may cost thousands of dollars in the addi- 
tional a t of time and space required for recording and 
the extra work involved in editing and tabulating. The 
principal rule should be: Never include an item without 
first having a clear idea of exactly how the information is to 
be utilized and what it will contribute to the purpose sought 
in the investigation. If its importance from this point of 
view is low compared with the additional cost of including 


- it, the item should be discarded. 


~~ 


There are, however, questions and items which, although 
they might be of considerable value, arouse antagonism on 
the part of the informant. These it may be wise to omit. ` 
This is especially true when the investigator is dependent 
on the cooperation of the informant for his information. 
Many people are inclined to be reticent on questions of men- 
tal ‘physical infirmities, marital and sexual relations, the 
use of intoxicants, etc., and attempts to secure information 
on such matters will usually arouse antagonism, Unless the 
purpose of the investigation distinctly calls for information 
on these topics, they had better be avoided, for indelicate 
inquiries on them may seriously jeopardize the more im- 
portant information we want to secure, These problems 
may be met in part by careful consideration of the wording 
of the inquiry. 

2. Dummy Tables 

One of the best ways of making sure that the schedule 
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will call for the pertinent data required by the investigation 
is to prepare in advance imaginary dummy or “ghost” tables. 


A dummy table is one which, if properly filled in, would 


answer the questions we want answered. -With these skele- 
ton tables before us, the next question is, what specific in- 
quiries in the schedule would yield data of the type and in 
the form necessary to construct such tables?_ By constantly 
keeping in mind the desired result of the investigation, most 
of the problems of inclusion or exclusion can be settled on 
this practical basis. i 


F 


3- Wording 

Headings and titles should be sufficiently complete to in- 
dicate fully the kind of data contained under them, and suf- 
ficiently clear to be understood by a person o inary in- 
telligence. Words and phrases of ambiguous meaning should 
be avoided. The enumerator, at least, should understand 
exactly what interpretation each item calls for. For exam- 
ple, if the item is “age,” this is subject to at least three inter- 
pretations : (1) exact present age, (2) age at last birthday, (3) 
age at nearest birthday. Accuracy and uniformity demand 
that this item be defined in the same way in every case, and 
the schedule should make the interpretation clear. When 
the schedules are to be filled out by skilled field workers, 
they may be given uniform instructions and interpretations 
in advance? This will make unnecessary such rigid specifica- 
tions in the schedules as is necessary when they are to be 
used by unskilled workers. In some cases the necessary in- 
terpretations may be so complex as to require more pagre 
instructions than can be printed on the face of the sch ule. 
Thus the Federal Census enumerators are equipped with a 
separate manual of instructions that-aims to cover in full all 
questions of interpretation of each item. Consider, for ex- 
ample, the following extracts from the instructions to enu- 
merators in connection with the recording of occupations : 


544. The term “laborer” should be avoided if any more precise 
statement of the occupation can be secured. Employees in fac- 
tories and mills, for example, usually have some definite designa- 
tion, as weaver, roller, etc. Where the term “laborer” is used, be 
especially careful to state accurately the industry or business in 
col. 29. 


s 


4 


*% «i steel mill, 


n 
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545. Avoid the use of the word “mechanic” whenever‘a more 
specific occupation can be given, such as carpenter, painter, elec- 
, trician; etċ. Automobile mechanic,’ however, is a satisfactory 
‘return. y & PA. e ) 

546. Avoid the use of the word “clerk” wherever a more defi- 
nite occupation can be named. Thus, an employee in a store who 
is wholly or principally engaged in sélling goods should be called 
a salesman and not a “clerk.” A typist, accountant, bookkeeper, 
cashier, etc., should be reported as such, and notvas a “clerk.” Do 
not return, a stenographer as a “secretary.” Distinguish a travel- 
ing salesman from a salesman in a store. . . i Bee. 

50. Fi employee who works for a concern that carries on 
different ies, the return should be the industry in which he 
directly works, provided that represents a major subdivision of 
the enterprise and is carried on in a separate building or in a 
pace physically distinct from other activities of the concern. 
us, a pinge orking in a coal mine, owned and o erated bya 

ld be returned as engaged in the coal mining im- 

dustry (that is, in his immediate place of work) and not as work 


_ ing for a steel mill. On the other hand, persons, working in a 


a unt or other unit incidental to the main work of an es — 
y tabli 


i 
f 


1 


€ 


a 


"5 


e 


ishment should be returned in the Industry column (col. 29) 
as engaged in the major activity of the establishmént. For €x- 
ample, ne industry return for workers in the power plant ofa 
steel rolling mill should be steel rolling mill; that for workers 
in the box making department of an electrical machinery factory 
should be electrical machinery factory ; that for workers in the 
garage or the warehouse of a department store should be depart 
ment store; and that for workers in the foundry of an agricul- 
noa implement factory should be agricultural implement fac 
552. Avoid General or Indefinite Terms. — Give the occupt 


tion and industry precisely. For example, return a-worker ip a 


Pa ie as a spinner, cotton mill; weigher, woolen mill; 
spoo 


» silk mill; etc. Do not report industry in such indefinite 
terms as refinery, transportation, electrical, etc, but specif 


petroleum refinery, sugar refinery, or copper refinery ; steam raik 
road, or bus line; electrical PHAR Ta sed ja pi: ctrical appli 
ance factory, etc.» Never enter in col. 29 such indefinite terms a$ 
factory,” “mill, ‘shop,” “store,” or “office,” without stating the 


kind of factory, etc, as soap fact tt ill, auto repay 
shop, grocery store, real pore d eer ake 

Whenever possible it is advisable to have the wording of 
the schedule itself make clear the exact meaning of the item; 


Bs SE g Census of the United States, 1940, Instructions to Enumeralor® 
. 61-63. 
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When this is not possible, the type of instructions given above 
is desirable. 

One of the commonest faults in the wording of schedules 
is the subjective type of answer an item suggests or allows. 
It is objectionable, for example, to have items in a housing 
schedule on heat, light, or ventilation checked with such 
subjective and relative terms as “good,” “fair,” “bad.” It is 
preferable for the schedule to specify objectively all the gen- 
eral types of heat such as, steam, hot water, gas, oil, elec- 
tricity, coal stove, etc., and to check the type which applies. 
Lighting may likewise be classified as natural and artificial, 
and it may be further sub-classified as direct or indirect. If 
more accurate information is desired, the relation of window 
space to floor space, the number of lamps and the power, etc., 
may be specified. In this way the investigations of different 
workers will be strictly comparable and subject to quantita- 
tive measurement, which would be far from true of reports 
in terms of “good,” “fair,” “bad,” etc. Whenever possible 
the questions or items should be so formulated as to permit 
the recording of the answer or the observation with a check 
mark, a cross, crossing out or underscoring words, a number, 
or “Yes,” or “No? The sample schedule (Fig. 3) repro- 
duced in this chapter is defective in this respect in several 
instances. (See last column, top, especially.) Leading ques- 
tions and questions requiring computations on the part of 
the informant should be avoided. We shall return to this 
subject in the next chapter. i 

The unrevised questions in the accompanying illustration 
(Fig. 4) further illustrate the failure to apply some of the 
principles set forth above. These questions were selected 
from a series of schedules on church and community affairs.“ 
The indefinite, qualitative, and subjective nature of the an- 
swers which they call for, or at least permit, would make 
the results of an investigation conducted with such a sched- 
ule largely worthless. % ; È 

The revision which follows each original question was 
formulated by a group of students as a laboratory exercise 
in the application of some of the principles concerning the 


4F. H. F. Ritchie, How to Study Your Association and the Community, 
Association Press, New York, 1926). 
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Ficure 3. Schedule for the Investigation of an Industrial 
Establishment. 


construction of schedules as discussed above. While these 
revisions are subject to further improvement, and while their 
form in many cases would be governed to some extent by the 


particular church, they represent a vast improvement over 
the original questions. 


4. Physical Aspects of Schedule” 


(a) Size and es i pe it is desirable to adopt a size 
and shape which fits some standard filing cabinet, €g» 
x 8 or 814x 11. A size which does not require foldin 
1s generally preferable, when the purpose can be serv 


5For further discussion of this subject as regards mail questionnaires, sec 
Chap. VII. 
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by a relatively brief schedule. There are cases where 
larger sheets are used to good advantage, however. For 
example, the Federal Census of Population (1940) em- 
ploys sheets 1814 x 24, with items on both sides. Each 
sheet has room for the recording of the data on 80 per- 
sons. The size of the schedule will be determined 
chiefly by the number of items it contains and the uses 
or methods of handling planned for the schedules when 
filled out. 

(b) Material and color.—The grade or kind of paper to be 
employed should be governed by the degree of perma- 
nency and the amount of handling expected for the 
schedules when filled out. In any case, the paper should 
take ink easily and be of such grade as not to tear easily 
or cause other trouble to the enumerator. For some 
purposes cards are very desirable. They are easy to 
write on when held in the hands of the enumerator, 
and if the schedules are planned for ready reference in 
a card file, their rigidity greatly facilitates locating, han- 
dling, sorting. On the other hand, they are more ex- 
ae and occupy more space than thinner and more 

exible paper. Color is of importance chiefly as a 
classification device. 

(c) Arrangement of items. — The arrangement of thesitems 
on the schedule is of Topora from four points of 
view: (1) The items should be so grouped and spaced as 
to facilitate the ease with which the eye follows a line or 
column, or is able to locate the part and subject-matter 
desired. Titles, headings, and sub-headings are important 
in this connection. The relative size of the orn and the 
position of the headings should indicate the re tains 
of different parts of the schedule. (2) The items should 
so grouped that related items, or items generally called 


rangement may be a great convenience, if the schedules 
are used for reference purposes, and may save much time 
in editing and tabulation, if they are to be handled 
statiacically. (3) Each item should be numbered or let- 
tered in a distinctive way so that it may readily be iden- 
tified and compared with the same item in other sched- 
ules. Each schedule likewise should bear sufficient 
identification marks to it its Ee or in the 
proper place if it should filed out of order or become 
ted from the filing place of the other schedules. 
is identification may be in the form of numbers, 
names, dates, or titles, (4) Undue crowding of items 
should be avoided. 
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FIGURE 4. — EXAMPLES OF DESIRABLE REVISIONS OF QUESTIONS 
APPEARING IN A QUESTIONNAIRE ON CHURCH AND COMMUNITY 


AFFAIRS 
I. Do you go to church regularly?..... EAN ‘ 
"Revision: ti 
(a) Do you go to church at all? No........ ae 


(b) On which of the following special holidays have you at- 
tended church within the past year : 


1. New Year'sy,.. 2. Good Friday.... 3. Easter.... 
4. Christmas............ EOI T e A RIALAK. 


(c) Indicate your attendance at the following : 


? Number of Times Attended | Past Month| Past Year 


1. Sunday morning service .|........0.)eeceeee eee 
2. Sunday evening service 


3. Mid-Week service 


II. Do you contribute to any worthy causes ? 
Revision : 


Indicate the approximate amount of your contribution to 
the following causes : 


Salvation Army 
s Red iomat: viutaien tne 


> 


Ill. 


IV. 
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Are you active in church WOTK? sobs. soe 
Revision: 
Check the church activities you have been engaged in dur- 
ing the past year: 
(a) Sabbath School teacher........++++-+- 
(b) Officer of the Sabbath Schoali.cities'a 
(c) Church official. .... +++ +seeereeeerees 
1. Elder.......5:. 
2. Trustee......++ 
3. Deacon.....--- 
4. Others........- 8; 
(d) Choir......+-+++- 
(e) Officer of or worker in other church activities 
1. Boys’ club work.....-+++++ 
2. Girls’ club work.....+-+++ 
g. Church socials. ....++++++- 
4 Others.. esere ennet. eden 


To what extent has this series of meetings been interesting ? 


Revision : 
1. How many of this series of meetings did you attend? 


. How happy in their work do the men seem to be? 


Revision : 
1. Total number of men employed....--++++++++ 
2. Total number of men employed 
(a) 5 years OF MOTE...+--++++eeees 
3 9 tO p YEATSi.-seeseee reer eee es 
c) 1 tog years .--- eee eeerreeeres 
d) Less than 1 year... sser 
(e) Less than 3 months.. s.-s... 
3. Number of complaints within past year of 
(a) Working hours ...--++++++++++ 


b) Wages seee ET nis 
c) Other conditions. . ++ -+++ «r17 


VI. To what extent has newspaper publicity been used? 


Revision: pe 
1. Total number of square inches of newspaper publicity 
received last year... -s.t t" pei 

ot 


Sq. inches Annual cost circulation 


(a) Paid Advertisement .-.----+ eetet e 
(b) News notes A ENEE E TIN 
(c) Other free publicity +.. eetere eee 
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VII. Is the room well lighted? 


Revision: 
1. Natural light : 
(a) Total sq. ft. window space......-.++++++ 
(b) Total sq. ft. skylight.........+..+-0++5 
(c) Total sq. ft. floor space.....-.-.++..++- 
(d) Total sq. ft. window space, opening on other rooms or 


OHAPOIChs.. 3. eave «ns ‘patie 
(e) Height (ft.) Distance (ft.) of next building. 
INGLE Dec tobi sss humate om hss + 
HOULBOedictastsews einen wee es 
LET hee Gera aaeeio d ep 
West seseseese. CEES 
2. Artificial light : 
aj No. of oil lamps............ 
b) No. of lapsenes sa esek? 
c) No. of electric lamps. ...... (d) Total wattage...» - 


In summary, we may say that the construction of a sched- 
ule involves attention to four distinct steps: (1) A list of 
questions to which answers are sought; (2) A set of dummy 
tables which, if supplied with the proper data, would answer 
these questions; (3) The arrangement of the questions or 
items in the schedule, with attention to physical appearance, 
wording, logical order, etc. ; (4) Instructions to enumerators. 

The most frequently neglected of these steps is the second, 
namely, adequate foresight as to exactly how the questions 
included in the schedule are to be tabulated so as to yield 
the answers sought. The subject of tabulation is a large and 
important one, and must be left to textbooks on statistics.° 
It is a subject intimately related, however, to the construc- 
tion of schedules, and attention should therefore be called to 
this fact in the present connection. If standard Hollerith or 
I.B.M. electric card-sorting and counting machines are to be 
used, as in ais sei, involving thousands of sched- 
ules and many interrelations, code-symbols for each possible 
type of answer should be inserted in the schedule, thus 
greatly facilitating the editing and card-punching necessary 
for machine tabulation. The standard machine punch card 

6A brief treatise including illustrations of machine methods, will be 
found in F. E. Croxton and D. J. Cowden, Applied General Statistics, (Prentice- 


Hall, 1940), ch. 2. See also, Katherine Phelps, “A Flexible Method of Hand 
Tabulation,” Jour. Marketing, Jan., 1939, p- 265. 
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contains eighty columns and twelve rows. Each possible an- 
swer is assigned a code-symbol corresponding to a particular 
column and row on the card. Each card is then punched 
to correspond to the answers on a particular schedule. 


F. THE TESTING OF SCHEDULES 


Even with the most careful preliminary preparation and 
criticism, most schedules have defects which only actual use 
in the field will reveal. For this reason it is always advisable 
to try out a new schedule in the field, and to revise it on the 
basis of the discovered defects, before going ahead with it on 
the whole investigation. This will usually result in the cor- 
rection of errors which would have impaired seriously the 
value of the study. We shall discuss this subject more fully 
in the next chapter. 


G. THE FILLING OUT OF SCHEDULES 


The purpose of many of the principles of schedule-making 
which have been discussed above is to insure that they will 
be intelligibly filled out. One of the principal points we 
have emphasized is that each item should be unmistakable 
as to what information is wanted and in what form the an- 
swer is desired. In short, one of the chief considerations in 
constructing a schedule is to make it as nearly fool-proof as 
possible. But the greatest possible care in the wording of 
the schedule will not eliminate the need for care in filling 
it out. Unfortunately, no schedule has yet been devised 
which will automatically record the facts we want observed 
and recorded. It requires intelligent treatment on the part 
of the investigator or the informant, as the case may be. It 
is an unfortunate fact that in addition to the inadequacy of 
our public and quasi-public records from the standpoint of 
structure and completeness, the pra ical value of these rec- 
ords is further vitiated by ignorance d carelessness in fill- 
ing them out. s 

In many cases, carelessness in filling out schedules is due 
to the fact that the person charged with this duty has no 
knowledge of the significance of the data he is asked to pro- 
vide or of what possible usefulness they may be when com- 
piled. As a result, such persons are usually indifferent, and 
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frequently hostile, to what appears to them a futile task. 
One frequently finds this attitude among public officials, 
teachers, and social workers. To the active worker, record- 
keeping is often “uninteresting,” and since each record be- 
comes merged with thousands of others, and perhaps is sum- 
marized in a cold average of some kind, many officials and 
workers are bored with the task of filling out schedules. 
With this attitude prevalent, it is not surprising that even 
in states where the law provides fairly adequate schedules 
for recording vital data, there is the most shocking indiffer- 
ence and incompetence evident in record-keeping. The 
general defense is that no one except professors, researchers, 
and other snoopers ever read the records anyway, and it is 
» therefore felt that it is useless to take any time or trouble in 
' making them intelligible. 

Almost any collection of a large number of schedules that 
have been filled out by a number of comparatively unskilled 
people contain a large number of pathetic and humorous 
examples of ignorance and incompetency. Whipple cites the 
following illustrations (some of them perhaps intentionally 


absurd) of the recording of the causes of death sometimes 
encountered : 7 


“Went to bed feeling well, but woke up dead.” 
“Died suddenly, nothing serious.” 
“Deceased had never been fatally sick.” 


“Last illness caused by chronic rheumatism but was cured be- 
fore death.” 


“Deceased died from blood poison, caused by a broken ankle, 


which is remarkable, as the automobile struck him between the 
lamp and the radiator.” 


“Typhoid fever, bronchitis, pneumonia and a miscarriage.” 
“Death caused by five doctors.” 
“Vital Statistics.” 


The above illustrations are doubtless extreme cases. Less 
absurd but no less serious errors may easily be found in most 
collections of records. The most common defects in filling 
out schedules may be summarized as follows: 

1. Defects in legibility.—If a word, letter, numeral or 
symbol cannot be read at all, or may be mistaken for some 
other symbol, this destroys the value of the record. While 


TG. C. Whipple, Vital Statistics (Second edition, Wiley, 1923), p. 349- 
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the wording of the schedule may eliminate as far as possible 
the need for writing, it cannot guard against illegibility of 
marks or symbols made by the field worker. Reasonable 
care alone can prevent this fault. 

2. Inadequacy or incompleteness of the matter filled in. 
— Sometimes the investigator, in filling out his schedule, will 
use abbreviations which are intelligible only to himself, and 
to himself only temporarily. This may make the record use- 
less in many cases. If other than standard abbreviations must 
be used, a key should be provided if their meaning is at all 
doubtful. Where the schedule requires the names of per- 
sons for identification or other reasons, confusion is often 
caused by incompleteness. Designations such as “Dr. Smith,” 
“Mr. Jones,” etc., are usually nearly worthless. The first, 
name and an initial, if possible, should always be secured. 
Care should also be exercised in securing the exact spelling 
of names and addresses. When dates are called for on the 
schedule, care in securing accuracy should be observed. 
They may become the basis of legal, financial, or other con- 
siderations of the greatest importance. 

3. Check marks and empty spaces. —_In many cases, a,check 
mark in a space of a schedule may have at least three mean- 
ings: “Yes,” “No,” or “Unknown.” Usually this ambiguity 
can be avoided by the wording of the schedule, or by spe- 
cific instructions. Thus a schedule which inquires into 
marital status in this way : “Married? . . .” will perhaps ordi- 
narily get an answer of “no” or “yes.” But frequently it will 
be answered by a check mark to indicate that the informant 
is married, or by a dash to indicate that he is not. Where 
single words are used as questions in this way, the safer way 
is to enumerate the alternatives and indicate that a check 
mark is desired: Thus, 


Single seese eetet 
Widowed .... -eese 
Divorced .. -ereet 
Separated .. -+t 


This form is also preferable to the form which has a series 
of words, calling either for the underscoring of the one it 
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is desired to indicate, or for the crossing out of all except 
the one it is desired to indicate. Lack of uniformity in 
designation, which results from the presence of such alterna- 
tives, causes confusion and misinterpretation. Spaces that 
cannot be filled for lack of information, or because they be- 
come irrelevant on account of a certain answer to a previous 
question, should not be filled with a check mark or other 
symbols, for they may lead to misinterpretation. A vacant 
space should be filled with a horizontal line to indicate that 
the space was not left vacant through oversight, and also to 
prevent subsequent insertions of data by someone else for 
purposes of corrupting the report. It also assists the inform- 
ant or the investigator in looking over his schedule for omis- 

pp sions, and gives assurance to the editors or users of the report 

~ that the omission was due to absence of information and not 
to oversight. The absence of information in a schedule may 
raise questions of good faith either on the part of the inform- 
ant or on the part of the registrar, and if the space has been 
filled with a line the defense of oversight cannot be so easily 
raised. For all of these reasons, therefore, it is generally de- 
sirable to fill all unused Spaces with a horizontal line extend- 
ing the full length of the Space. 

The final test of desirable ways of filling out a schedule 
should be: Do the marks convey the desired information un- 
mistakably ? Through a little care and the observation of 
the simple rules stated above, the accuracy and usefulness of 


information contained on schedules could be greatly in- 
creased. 


H. ADVANTAGES OF PERSONAL INVESTIGATION 


All of the types of schedule discussed in this chapter as- 
sume, first, that only the more objective factual data are 
sought. Secondly, it is assumed that these schedules will be 
filled out (a) by a more or less skilled field worker who either 
records the data from direct observation or gets it first-hand 
from interviews with other people ; or (b) by the informant 
without, personal contact with the investigator. As has al- 
ready been noted, the former type of investigation does not 
Tequire that a schedule should be as detailed in its instruc- 
tion$'for filling out, nor indeed as fool-proof generally, as the 
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questionnaire type which is sent out or distributed for indi- 
viduals themselves to fill out. When the extent of the in- 
quiry, the time, and cost permit, the personal investigation is 
usually preferable and has many advantages. First, it per- 
mits the use of more complex and more numerous ques- 
tions. When the schedule is filled out by a skilled investiga- 
tor complex questions can be explained in whatever terms 
are necessary to elicit the true answer. Several related and 
corroboratory questions may be asked orally to secure a single 
answer. Second, a field investigator can be trained so as to 
avoid the errors to which the inexperienced or untrained 
person is subject. Field workers can also be selected on the 
basis of special advantages of personality and address which . 
will secure responses from individuals who could not bel? 
reached at all by less personal approaches, such as letters. 
These special qualifications of field workers will be consid- 
ered in a later chapter. Finally, more complete returns can 
usually be secured through personal investigation. 

The advantages of personal investigation are perhaps pres- 
ent in the highest degree where the entire study is performed 
by a single investigator, provided he is animated by scientific 
ideals. This usually insures uniformity of approach and in- 
terpretation throughout the study as well as the steadying 
influence of sole responsibility for the accuracy of the re- 
sults. In many cases it may also mean that the study profits 
from the enthusiasm and care of a person deeply interested 
in the results, Of course, there may also be danger in this 
personal interest. A person may become so enamored of 
his hypothesis as to render him incapable of impartial and 
objective E rieti But even this is a failing which is 
least likely to be present in a person of scientific training and 
ideals. Most field investigations will, of course, require a 
number of interviewers. 

A great many of the data of sociology, furthermore, have to 
be secured at present through less ideal methods than per- 
sonal investigation by a trained scientist in the field. In such 
cases, reliable results may still be secured if care is exercised 
in impersonal approaches. Many important social data are 
of a less concrete type than those contemplated in the present 
chapter. We turn our attention in a subsequent chapter to 
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methods of securing and interpreting the less tangible social 
data, such as opinions and attitudes. 


I. CONCLUSION 


We have considered in this chapter the schedule in gen- 
eral as an instrument for aiding the senses in social observa- 
tions. Our attention has been devoted chiefly to the types 
of schedules which assume (a) that a skilled investigator will 
use them, mainly as convenient aids to his memory, and (b) 
that the data called for are of a relatively objective, factual 
sort. Sometimes, however, we wish to prepare schedules 
which, in addition to the above purposes, will serve as a set 
of stimuli to elicit the cooperation of the person from whom 

„information is sought, and as a reliable index to that person’s 
feelings, attitudes and probable behavior in stated circum- 
stances. These purposes are also sometimes served, of course, 
by the types of schedules discussed in the present chapter. 
More frequently the questionnaire designed for impersonal 
distribution is used in the latter type of inquiry. We turn, 


therefore, to a consideration of the questionnaire as a special 
kind of schedule. 


J- SUGGESTIONS FOR FURTHER STUDY 


The best supplementary material for this chapter is the 
examination of large numbers of schedules and their ap- 
praisal from the point of view of the principles discussed 
above. For this purpose the instructor will do well to collect 
examples of the blanks, forms, registration cards, etc., used 
by local, state, and national governments. In addition to 
the various schedules used by the Bureau of the Census, the 
Bureau of Labor Statistics, and the Department of Health 
will provide examples of excellent schedules. As a matter of 
contrast, a collection of the schedules used in local agencies 
will usually provide plenty of material for discussion and im- 
provement. 

Good examples of schedules that have been developed for 
field use in special studies will be found in the publications 
of the Works Progress Administration, Division of Social Re- 


THE SCHEDULE 181 


search, See, for example, Urban Workers on Relief, Re- 
search Monograph IV (1936), p. 104 Part I; Part-Time 
Farming in the Southeast, Research Monograph IX, 1937, 
Appendix E; Rural Migration in the United States, Re- 
search Monograph XIX, 1939, pp- 179-733 Migrant Fami- 
lies, Research Monograph XVIII, 1938, Appendix C. 

A very good chapter on the general subject of schedule- 
making will be found in Lyndon O. Brown, Market Research 
and Analysis, (Ronald Press, 1937): 

Other types of schedules will be referred to in the next 
two chapters. 


CHAPTER VII 


å THE QUESTIONNAIRE 
b 

Have you ever noticed this: That people never answer what you say? 
They answer what you mean, or what they think you mean. Suppose one 
lady nia to another in a country house: ‘Is anybody staying with you?” 
The lady does not answer; ‘Yes the butler, the three footmen, the parlor 
maid,’ and so on, though the parlor maid may be in the room, or the butler 
behind her chair. She says: ‘There is nobody staying with us,’ meaning no- 
body of the sort you mean. But suppose a doctor inquiring into an epidemic 
asks, “Who is abo in the house?’ then the lady will remember the butler, 
the parlor maid, and the rest. All language is used like that ; you never get 
n "cesar pemarated literally, even when you get it answered truly.—G. K. 

esterton. 


` A., THE QUESTIONNAIRE AS A SPECIAL TYPE OF SCHEDULE 


The principles that were set forth in the preceding chapter 
regarding schedules are also in general applicable to the ques- 
tionnaire. There are, however, a number of additional con- 
siderations which should be kept in mind in the construction 
of questionnaires, especially when they are to be filled out 
without the direct personal solicitation and aid of an inves- 
tigator. Many data are at present collected by this method in 
the social sciences, to say nothing of the information secured 
in this way by governmental and private agencies and busi- 
ness concerns, 

Among the reasons for its extensive use is the possibility 
of carrying out this kind of inquiry with a small staff and 
consequently, if the results are satisfactory, at a low cost. 
The latter advantage varies considerably in different types of 
inquiry. In any case, cost can be appraised only in relation 
to the comparative adequacy and reliability of the results se- 
cured? Another consideration in favor of the mail ques- 
tionnaire is its impersonal nature. It avoids the variety of 
possibly irrelevant stimuli involved in personal contact. It 
also preserves the anonymity of the informant in inquiries 
where this is desirable and may be an important condition 

i F 3 a 
Tie cs Ua hak Ak AE M 

ting Rev., 4:5, 193) 


2 The advantage of low cost has been denied iti foot- 
note 54, me ad lenied by some authorities. See 
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for securing his cooperation. As one of the more common 
instruments of social inquiry, the principles governing the 
construction and testing of questionnaires and the interpre- 
tation of results secured from their use, therefore, is a sub- 
ject of considerable importance. "a 
Fundamentally, the questionnaire is a set of stimuli to 
which literate people are exposed in order to observe their 
verbal behavior under these stimuli. Incidentally we may 
also observe some of the other behavior of the informants, 
such as the frequency with which they will return the ques- 
tionnaire under stated conditions, and how this behavior 
and, of course, their replies, correlate with other character- 
istics of the respondents. As to subject matter, question- 
naires may be divided for our purpose into two large 
categories : (1) Questionnaires of fact, which merely ask for in- 
formation subject to corroboration regardless of the inform- 
ant’s opinions or attitudes, e.g., income tax blanks. (2) Ques- 
tionnaires of opinion, and attitude, in which the informant’s 
preferences are the basic data sought. We shall be concerned 
in this chapter with both of these kinds of schedules with re- 
gard to principles of construction, distribution, and securing 
the returns, We shall also assume for the most part that the 
questionnaire is to be answered by mail. Some special con- 
siderations governing the attitude questionnaire and the 
more elaborate tests and scales are reserved for the next chap- 
ter. * 


B. THE SOLICITATION OF COOPERATION pP. 


The request to fill out a questionnaire usually should be 
in the form'of a personal letter in order to take advantage of 
the already established habit of answering this kind of mail. 
The writing of persuasive letters is an art that has received 
much attention from sales organizations and others. The 
same general principles apply to letters soliciting informa- 
tion by questionnaire. The first thing that attracts atten- 
tion is the stationery and the letter-head. These features of 
a letter correspond perhaps to personal appearance 1n an 1n- 
terview. The recipient of a letter draws certain immediate 
conclusions as to the character, respectability, and impor- 
tance of the investigation on the basis of this first impression. 
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The letter-head also provides a desirable way of saying at 
the outset some things that the recipient very much wants to 
know even before reading the letter, and which will influ- 
ence the attitude with which he reads it. These are the ques- 
tions of “Who is approaching me?” and “For what purpose ?” 
The letter-head of a university, research or civic organiza- 
tion with prestige in the area to be investigated, perhaps 
with a subtitle indicating the purpose of the inquiry, to- 
gether with the names of prominent individuals sponsoring 
the project, answers these important questions. The sta- 
tionery and the letter-head, therefore, may determine at the 
outset whether the recipient gives further attention to the 
letter or throws it and the questionnaire away.’ 

-~ The nature of the appeal to be made in the letter is the 
next problem. The recipient must be motivated and pro- 
vided with an incentive that will overcome the reluctance 
most people feel toward writing according to rules, not to 
mention the fact that filling a questionnaire takes time which 
might otherwise be spent on pleasanter activities. Sletto ex- 
perimented, in a carefully controlled study, with three types 
of letters requesting 1600 former university students to an- 
swer a long questionnaire. His procedure and results are of 
interest : 

“The first letter called upon the individual to help im- 
prove education for the thousands of young people who 
would be entering the University during the year. The sec- 
ond letter directed attention to the changes occurring in edu- 
cation and requested help to guide these changes ‘in the right 
direction.’ The third letter challenged the recipient to help 
do something that people say ‘can’t be done.’ The letter ex- 
plained that many people believed a study like this could not 
succeed because former university students are too much 
concerned with their personal affairs or too uninterested in 
education to respond to a lengthy questionnaire... The 
first letter, containing the altruistic appeal to help improve 
education for the thousands of young people entering the 
University, yielded the highest returns, amounting to 67 per- 
Pisin: Watson (“Investigations by Mail,” Market Research, 7:11-17, 1937)» 


in an experimental study no significant differences in number of re- 


turns from printed as compared with multigraphed letter-heads, but found 
that special fletter-heads rather than a mere address was helpful. 
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cent. The second letter yielded 64 percent, and the third 
letter 6o percent. The first letter was, therefore, adopted 
for use when the final questionnaire was sent to the final 
group. The differences in the proportion of returns yielded 
by the three letters are not large enough to yield critical 
ratios that are statistically significant according to conven- 
tional standards. The superiority of the first letter in pro- 
ducing returns was, nevertheless, consistently maintained 
throughout the time period of the returns. Since the pre- 
testing of questionnaires will normally involve small num- 
bers of cases, we must expect most ‘true’ differences to yield 
small critical ratios.” 4 

These results are corroborated by market research experi- 
ence in which letters merely appealing for information as a’ 
personal favor have been more productive than letters offer- 
ing a trivial reward or stressing some alleged advantage to the 
respondent for answering.’ Generally speaking, we) may say 
that the chief resistance to be overcome in mail question- 
naires as in personal interviews is a suspicion that someone is 
trying to sell something. It is well, therefore, to make clear 
at the outset by all possible means that information only is 
sought, and that no tricks of salesmanship are being em- 
ployed. 

Among the important inducements to reply is a stamped 
and addressed return envelope. The convenience of this ar- 
rangement, its favorable psychological effect, the amount of 
the postage, plus perhaps a reluctance to throw away a 
stamped envelope, appear to be among the factors involved. 
The printed “Business Reply Envelope,” on which the post- 
age is paid by the person or organization to whom the en- 
velope is addressed, has been extensively used in recent years. 
Since postage is paid on these envelopes only if they are 
returned, they are assumed to represent a considerable econ- 
omy for organizations sending out many return envelopes 
(although the Business Reply Envelope requires 4 cents). 
Watson found, however, in one market investigation that in- 
quiries accompanied by the old-type stamped envelope se- 

å R. F. Sletto, “Pretesting of Questionnaires,” Amer, Sociol. Rev., 5:193-200, 


1940. 
5R, Watson, op. cit. 
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cured twice as many replies as the Business Reply Envelope.° 
This investigator remarks: “Women apparently hated to see 
that 3-cent stamp go to waste.” If the findings of this study are 
generally valid, the economy of the business reply envelope is 
doubtful. The unmistakable results in the study mentioned 
illustrate also the unsuspected psychological factors that may 
determine whether a questionnaire is returned or not. 

One difficulty with the formulation of an effective appeal 
to accompany questionnaires is that these requests must be 
formulated so as to have general appeal. In an interview the 
approach can be modified to suit the individual case. If the 
questionnaire is to be circulated in a selected group, a knowl- 
edge of the interests, prejudices, and intelligence of that 

‘group will be of great help in formulating an effective ap- 
peal. Where the group is heterogeneous in character, gen- 
eral knowledge of human nature and the experience of other 
similar "ipa gegen must be used as a guide. As in the 
formulation of the questionnaire, the least intelligent, rather 
than the average informant, must be kept in mind. 


(a 
C. THE CONTENT AND STRUCTURE OF THE QUESTIONNAIRE 


All of our remarks in the preceding chapter regarding (1) 
the importance in schedules of having clearly in mind the 
specific questions to be answered, (2) clarity of wording, and 
(3) discretion in the choice and arrangement of items, apply 
with even greater emphasis to questionnaires. The ques- 
tions of explanation and interpretation that almost invari- 
ably arise in an interview must be anticipated and answered 
by the questionnaire itself, preferably by a wording and ar- 
rangement of items so unmistakable as to prevent the ques- 
tions from arising. When this is impossible, detailed and 
unmistakable instructions must accompany the question- 
naire. If possible, such instructions as are necessary should 
be inserted at the place needed rather than at the beginning 
or the end of the questionnaire. 

_ When we have decided upon the specific purposes of the 

inquiry, the first task is to formulate questions calculated to 

yield the desired information. These should be as few as is 

consistent with the ends sought. Beyond this consideration, 
6 Ibid. 
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A MESSAGE FROM THE PRESIDENT OF THE UNITED STATES 


THE WHITE HOUSE 
WASHINGTON 


g . 
TO EVERY WORKER: h 

If you are unemployed or partly unemployed and are able to work and are 
seeking work, please fill out this report card right away and mail it before mid- 
night, Saturday, November 20, 1937. No postage stamp is needed. 

‘The Congress directed me to take thi er It is important to the unem- 
ployed and to everyone in this land tha the census be complete, honest, and 
accurate. If you give me the facts I shall try to use them for the benefit of all. 
who need and want work and do not now have it. E 


febli: Helent- 


UNEMPLOYMENT REPORT CARD 
Please fill out this card if you ate totally or partly unemployed 
Answer all q: 


uestions promptly A 
i portant’ th totally or partly unemployed worker in your household fill out a TE report card. 
Ada tts bmpartant Cast ivr froma your postman Gr post ofico. kmr in onr Aona erines baso questions you need 


it, at any post office or from any postal employee. Do AN 
1. Print full name........ on a farm? meet- 
Print, full oro cr) Geass 
E eine a Couar eae) 
2. Are you: "Gireet tod nomber @ R. F. DD P pn i o s o 
(a) Totally unemployed and want work?.__.-| | r ` 
(b) Partly omployed said want more work? 7. How many hours did you work last week? -zr hours 
(© Working at WPA, NYA, OCO, or other Enter total number of hours worked at ALL (except WPA, 
WOT? saeeemmeenereserrsererensennensn NYA, i er ar ery ort aoe & == tom 
3. Av yon cbls' Geckt eee Pee LED nol] 8. How many woeks did Teinat 
4. Ago ot last birthday.rossesrmeeseseeneeee een TORS i ae aA AAI D WPA, 
8. Color { g NYA, COG, or other emergenc so dag the TS montas 
OF race ESA e a) Negro L_] Other from ‘November 1, 1936, October 1937. 
9, What is your occupation, or kind of work? or ie sie Cai apuia eam 
manent maaana A arg e a ea ae she Geran N T 
Nio demba sgh one RS amaie a ny or employer as an answèr to question 10 
10. Kind of business or industry in which yow 
did, or aro doing, this kind of work?. St rns 
u. workers family li in the same houschold with you?.-...- ———— workers 
Eoy many other workers aro er in, 7o mng wer oh Porn 


12. How many of those workers are: „sm... (a) Totally unemployed and want work? -—-- ——— workers aie 
and want more work?....— Orker } count 


Note—Every totally or pardy (b) Partly employed 
Sept tn ewan oui gate | (o) Working ot WPA, NYA, O00, P Tad 
13. How many persons aro mainly dependent on you for support? ce st wast mumtO-m-=1-1=1=== ————— dependents 
14. What individ: end othor, last week? —-—---eoneneenne-sneenrrnenermnren nnn ————— 
hat was your individual total ingore SEFA OCG, or eer enerenoey wore 
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no rule can be laid down as to maximum, minimum, or op- 
timum numbers of questions. _ The other variables determin- . 
ing whether ornot a questiofinaire will be returned are too 
numerous to enable us to estimate with much reliability the 
degree to which length of the schedule was a determin- 
ing factor. Sletto selected goo" former university students ` 
and divided them into three subgroups, every third name 
from an alphabetical lis ing assigned to each group. He 
describes the results as ws: as 

“The first subgroup was mailed a 10-page questionnaire 
relating to their vocational activities, interests, and needs. 
The second subgroup was mailed a 2 5-page questionnaire re- 
lating to their socio-civic activities, interests, and attitudes. 
The third subgroup was mailed both of these questionnaires 
totaling 35 pages in length. These mimeographed ques- 
tionnaires were of the check-list type, and all were mailed 
on the go day. Utilizing three follow-up requests} the pro- 
portion of returns was 68 percent from the group sent the 10 
pages and 60 percent from that sent the 25 pages. «The 
group sent both questionnaires in the same envelope yielded 
63 percent returns. Since the group sent the 35 pages yielded 
a larger proportion of returns than the group sent the 25 
pages, and only five percent fewer returns than the group sent 
the 10 pages, it was concluded that the factor of length, be- 
tween the limits of 10 and 35 pages, was not likely to affect 
returns to the final questionnaire by more than five to ten 
percent. There is reason to think that the difference between 
the returns to the 10 pages and to the 25 pages was less a 
result of length than of the interest value of the question- 
naire content, since the group sent both questionnaires 
yielded a larger proportion of returns than did the group 
sent the 25-page questionnaire, 

“This first experiment in pretesting suggests that the fac- 
tor of length is less important than it has generally been as- 
sumed to be, insofar as proportion of returns is concerned. 
However, it may be that the effect of length is more pro- 
nounced within the limits of one to ten pages, and that 
within this range a virtual plateau is reached in question- 
naires of the checklist type. If such a situation exists, is 
there some limit beyond which the returns begin to fall off 


` 


: > 
THE QUESTIONNAIRE 189 


sharply when additional pages are added to the question- 
„naire? Would these same results be gained, if the group 
were not one consisting of college-trained people? Future 
research alone can provide us with answers.” 7 
_ Another student of the subject has maple point that it is 
not so much the length of the questionnaire as the number 
of unrelated subjects dealt with that is objectionable.’ If the 
questionnaire is an integrated , and if the questions 
have obvious relationship to each other and to the avowed 
purpose of the inquiry, the mere number of questions will 
seem less important to the person answering. In"short, the 
capacity of the questionnaire to arouse the interest of the 
person to whom it is presented and to cause him to feel that 
this is a “sensible” inquiry on a subject of significance, is 
perhaps of greater importance in determining his willing- 
ness to respond than is the length of the questionnaire. 
Even ff research should reveal that the length of the ques- 
tionnaire, within limits, is a factor of minor importance in 
determining the proportion returned, there are, of course, 
other grounds, such as mere economy of space and labor, for 
keeping the schedule as brief as is consistent with the results 
we seek. Apparent exceptions to this rule would be the in- 
stances in which it is deemed advisable for psychological rea- 
sons to insert a limited number of “waste” questions in order 
to stimulate interest, facilitate the approach to delicate sub- 
jects, or corroborate other questions. For example, a market 
study of people’s preferences in building material as between 
lumber, marble, concrete, etc., might begin with an inquiry 
about the person’s attitude toward the conservation of natu- 
ral resources, in the name of which the original approach 
may have been made, as an appeal to broad social and civic 
interests. Again, a question about the changing purchasing 
power of the dollar may serve as a disarming opening in a 
questionnaire designed really to determine present expendi- 
ture habits. An investigation of the use of various methods 
of contraception began with the question “Would you ap- 
prove of straight-forward, tactful, inoffensive advertising of 
3R Seem, yg el of a CANE. Market Research, 
May, 1936, pp. 17-19- r 
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this type of product?” “The question served to draw into 
a discussion of birth control the very women who were op- 
posed to such discussion, and who would otherwise have been 
lost from the sampling. Such women, having given a nega- 
tive response, weihen asked how they thought such infor- 
mation should be made available, if not through advertising. 
In most cases, anxious to defend their stand, they were will- 
ing to answer and to proceed with the remainder of the in- 
terview.”® While this case involved replies to questions in 
an interview rather than in a questionnaire, the principle of 
the technique is the same. 

Even if no “waste” questions, (which need not be tabu- 
lated) are deliberately included, it is advisable to begin the 
questionnaire with inquiries that are unobjectionable, fa- 
miliar, and easily answered. The routine items of sex, na- 
tionality, length of residence in present place, etc., satisfy 
these requirements. If the first questions appear easy, a 
person may get started answering and, if so, an important 
objective has been achieved. i 

The arrangement of the questions in the body of the 
schedule is also a matter of some importance. A question 
which is difficult, objectionable, or sure to arouse antago- 
nism at the beginning of a questionnaire may be answered 
without any hesitation after an adequate foundation has 
been laid for it and after the person answering has “warmed 
up.” The reinforcing effect of wishing to finish something 
that has been started can be counted on to secure answers 
to questions in the last part of the questionnaire which, if 
asked at the outset, would probably have caused refusal to 
return the schedule. The whole psychology of questioning 
1s very intricate. Certain aspects of the technique have been 
very highly developed by the legal profession. Many other 


aspects and types of interviewing remain to be studied by 
research workers. 


D. WORDING OF QUESTIONS 


The order of the questions is very intimately related to 
their wording, and vice versa. A question that is perfectly 


9 J. W, Riley and Matilda White, “The Use of Various Methods of Contra- 
ception,” Amer. Sociol. Rev.,»5:890-903, 1940, p- 8g. 
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clear if it follows several other questions might be unintel- 
ligible if asked by itself. That is, the wording necessarily 
depends in part upon the order in which the questions are 
asked. What we said in the preceding chapter regarding the 
wording of schedules in general also appliés with special em- 
phasis to the wording of questionnaires. The illustrations 
given in Chapter VI will also serve to emphasize some elemen- 
tary considerations in the present connection. The matter 
will come up again in the next chapter in connection with 
the measurement of attitudes. We shall confine ourselves, 
therefore, at present, to a few special cautions in the wording 
of questionnaires. 

Bowley’s general rules on this point constitute a good 
summary of the ends to be aimed at, although they do not 
tell us much about how to achieve these results. According 
to this authority, questions should have the following char- 
acteristics : 

. Comparatively few in number 

. Require an answer of a number or of a “yes” or “no” 
. Simple sa to be readily understood 

. Such as will be answered without bias 

. Not unnecessarily inquisitorial 


. As far as possible corroboratory by É 
. Such as directly and unmistakably cover the point of in- 


formation desired.1° 

The education and general intellectual level of the public 
from which answers are desired will determine to a large ex- 
tent what degree of simplicity (i.e., in vocabulary, sentence 
structure, etc.) is to be employed. The only final guide on 
this point, as indeed on most others, is adequate pretesting 
of the questionnaire on sample populations, and the revision 
of the schedule on this basis. In the preparation of the pre- 
liminary forms for such testing, the advice of persons inti- 
mately familiar with the group to be studied is much to be 
desired. Bowley’s rules may be supplemented by several spe- 
cific observations on problems that always arise. 

1. Categorized versus “free” answers. Is it desirable to 
list, as far as can be imagined, all possible answers in order 
to permit replies exclusively by a check, mark or by yes or 

10 A L. Bowley, Elements of Statistics, (Fourth edition, King, London, 1920), 
pp- 18-25. 
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no? This is one of the most difficult problems in the for- 
mulation of questionnaires. Unless the answers sought are 
sharply and obviously exclusive, and exhaustive, the listing 
of a variety of possible responses frequently results in the 
checking of several answers, so that the result may be diffi- 
cult to interpret. For example, to inquire regarding marital 
status by requesting a check mark after one of the categories, 
“married,” “single,” “divorced,” “divorced and remarried,” 
“separated,” “widowed,” is likely to yield satisfactory results 
because the categories are exhaustive and mutually exclusive. 
Consider, on the other hand, the following: 


Where did you first meet your college roommate : 

(a) At preparatory school? . . . 

is) In the same community? ... 

c) In the same neighborhood? . . . 

(d) Other. . . , 
Obviously the first three of these might be checked. If an 
attempt is made to list all possible answers in a case of this 
kind, an absurdly long list would have to be appended. If 
this practice is adopted in questions of attitude we may sug- 
gest to the person receiving the questionnaire many answers 
which he probably otherwise would never have thought of. 
In some cases this may be desirable, e.g., when we are study- 
ing how people will respond when exposed to certain sugges- 
tions. But in many questionnaire inquiries we are probably 
interested in the person’s own formulation of his response. 
If we allow him to express this response in his own words, 
difficulties of interpreting his answers are sure to arise. Con- 
versely, if we limit him to our own list of answers we force 
his response into our prearranged categories. 

Most practical inquiries, in the face of this dilemma, adopt 
a compromise. All questionnaires are necessarily formu- 
lated on the basis of some knowledge of the type of answer 
probable. Such knowledge may be secured from experi- 
mental questionnaires permitting free response to questions, 
without any attempt to categorize the answers in advance. 
The more intimately the investigator knows his public, the 
more effectively he can formulate questions and the catego- 
ries into which most answers will fall. Hence a careful scru- 
tiny of all available previous attempts to study the subject 

A 
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under Consideration should» precede the formulation of a 
schedule. Adequate pretesting of the questionnaire, should 
yield most of the knowledge needed for a formulation of an- 
swer categories which will do a minimum of violence to the 
spontaneous reaction of the respondent. The easy way in 
constructing the questionnaire is to ask merely the questions 
and allow the respondents complete freedom as to the words 
and categories in which they wish to answer. The classifica- 
tion of these responses after the schedules come in then be- 
comes the problem. Except in cases where the possible an- 
swers are mutually exclusive and exhaustive, a compromise 
is usually adopted. The most likely and frequent answers 
are listed for check, with a category, “others,” concluding the 
list of possible answers. The problem of classifying the re- 
sponses received under the omnibus category is then the 
same as mentioned above. But if the responses in this cate- 
gory are few and not too varied, the proportions of the prob- 
lem are greatly reduced as compared with the situation when 
no attempt is made to categorize the answers. 

2. “Leading” questions. Closely related with the problem 
of categorizing possible answers is the problem of “leading” 
questions in general. These are questions so phrased as to 
suggest a certain answer. The salesman in a store approaches 
the customer with, “What will you have?” rather than, “Will 
you have something?” He asks, “How many?” instead of 
“Do you want more than one?” These may seem trivial 
illustrations, but the phrasings suggest an answer that the 
questioner desires. Susceptibility to this type of suggestion 
naturally varies among different people... Sometimes eti- 
quette itself seems to dictate the propriety of giving people 
the answers they desire. Thus, Stefansson reports that the 
Eskimos tend to give encouraging reports about lost explor- 
ers because of the obvious desire of searching parties to se- 
cure such reports? The Chinese are correspondingly po- 


11 V, Stefansson, Unsolved Mysteries of the Arctic, (Macmillan, 1939): “If 
an investigator said to a primitive Eskimo that surely the Franklin Lat 
must have killed a lot of caribou, the answer — irrespective of fact — woul 
be that they certainly did, that they killed an awful lot.” (p. 118) 

Leading questions may, of course, have value if they are asked deliberately 
and the results are treated accordingly. For example, if affirmative replies 
are secured even after strong counter-suggestion by the form of the question, 
this may be a valuable measure of the firmness of a belief or of the resistance 
to be overcome. CRR. Franzen, op. cit. 
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lite12 Suggestiveness in response to leading questions may 
vary considerably with age, education, and different cultures. 
An example of the same phenomenon is found in the “yes” 
tendency in public opinion polls. For example, it is better 
to ask, “Is it desirable or undesirable to balance the budget,” 
than to mention only one of the alternatives. 

Leading questions are most likely to be employed, some- 
times quite inadvertently, when the investigator is eager to 
have his results prove rather than disprove some cherished 
thesis. If an investigator of the causes of application for re- 
lief should say to a laborer, “I suppose you were forced to 
seek relief because of the depression ?” or, “Unemployment 
is inevitable under capitalism, isn’t it?” the answers would 
be likely to show a high percentage of affirmative responses, 
especially if it is assumed that the questioner may have influ- 
ence in the granting of relief. Such flagrant cases probably 
do not occur in any study purporting to yield unbiased re- 
sults. But the principle involved frequently crops up in 
subtle ways because of the emotional aura attached to cer- 
tain words. We shall return to this subject in the next chap- 
ter. More refined methods of determining gradations of 
intensity or firmness of beliefs, as well as the undesirability 
of hypothetical questions, and questions hedged about with 
too many reservations, will also be considered in that chap- 
ter. In the meantime we may summarize here some of the 
results of actual research into this important field. 

3- The effect of stereotyped words. We have already re- 
ferred (p. 66) to the biasing influence of attaching certain 
words, such as “Red,” “radical,” “Fascist,” “Communist,” to 
questions regarding policy or practice. Thus Menefee found 
that certain propositions were accepted when not character- 
ized by these labels but were rejected when these labels were 
attached.'* Turner? warns against the emotional connota- 


12 Carl Crow, Four Hundred Million Customers, (Harper, 1987), pp- 132-83; 


18S, Menefee, “The Effect of Stereot Words on Politica P gments, 
Amer. Sociol. Rev., 1:614-21, 1996. biasing influence of the word 
“Fascism” at present in the United States has also studied by E. Raskin 


and $, W. Cook ("A Further Investigation of the Measurement of an Attitude 
toward Fascism,” Four. Social Psychol., 9:201-06, 1938) with similar results. 
See also R. Stagner, “The Prestige Value of Different Types of Leadership,” 
Sociol. and Soc. Research, 25:403-19, 1941- 

UR. C. Turner, “A Public Opinion Poll on Educational and Other Issues,” 
Michigan Public Opinion Surveys, 1939, p- 49- 
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tions of capitalistic, save, guarantee, rich, and charity. The 
Institute for Propaganda Analysis’ notes a number of words 
as examples of “name-calling” rather than designations of 
objective content. Among them are such words as alien, 
demagog, dictator, economic royalist, miuckraker. 

The names of political parties** and political personages, 
when associated with certain inquiries, also bias the re- 
sponse. If reliable attitudes on issues alone, aside from par- 
ties or personages, are desired, the names of the latter should 
be avoided. Thus the question, “Do you approve of the idea 
of having Thanksgiving a week earlier this year?” received _ 
a less favorable response than when the question was worded, 
“Do you approve of President Roosevelt’s idea of having 
Thanksgiving a week earlier this year 2717 Likewise, Stu- 
denski showed that the wording “spending” as applied to 
governmental finance arouses antagonistic responses not stim- 
ulated when the question is rephrased so as to omit this 
word.1® 

4. Ambiguous words. The different meanings that words 
have for different people is a source of constant and subtle 
errors in the interpretation of questionnaire results. The 
word “kinds” as in, “What kinds of powder do you use?” is 
sometimes taken to mean what brands; sometimes the an- 
swer will be in terms of form (loose or compact) ; and some- 
times in terms of color2® Robinson found that the word 
“publicly” has different meanings to different people in such 
a question as, “Is the electric utility in your community pri- 
vately or publicly owned?” Some responded that it was 
publicly owned because the public owned the shares. 

The extensive interest in public opinion polls has greatly 
stimulated attention to the wording of questions and the 
results of systematic analysis of future questionnaire studies 


15 Propaganda Analysis, 1, No. 1, Nov. 1937- 1 5 

16 A. B. Blankenship, “The Choice of Words in Poll Questions,” Sociol. and 
Soc. Research, 25:12-18, 1940. g fi 

17 Ibid. See also E. Roper, “Wording Questions for the Polls,” Public 
Opinion Quarterly, 4:129-30, 1949. s, 

18 P, Studenski, “How Polls Can Mislead,” Harper’s Magazine, 180:80-83, 
1939- 

is A. B. Blankenship, op. cit. 

20C. S. Robinson, “The New Science of Public Opinion Measurement and 
its Implications for Business,” Harvard Business chool Alumni Bulletin, 


1939; PP- 3-8. i 
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will doubtless make available much reliable knowledge at 
which we can now only guess from scattered studies of the 
kind here mentioned. A useful guide to vocabulary is E. L. 
Thorndike’s A Teacher's Word Book of 20,000 Words." It 
treats the level of difficulty of different words as indicated 
by the frequency of use in common sources of literature. 
Only adequate pretesting of the questionnaire with definite 
checks will reveal to what extent the vocabulary used means 
the same to the public which we propose to investigate. 


E. PHYSICAL AND ESTHETIC ASPECTS 


The considerations noted in the preceding chapter (p. 170) 
with respect to the physical aspects of schedules in general 
apply also to questionnaires. We have also noted above cer- 
tain considerations regarding the stationery and printing of 
the introductory letter. Sletto’s report on several experi- 
ments on the effects of physical and esthetic factors in ques- 
tionnaires is illuminating both as an indication of the kind 
of items to be considered and as a report on one of the first 
inquiries of its kind. 

“Several pretesting experiments were conducted to gain 
knowledge applicable to enhancing the attractiveness and 
interest value of the questionnaire. The knowledge acquired 
in these experiments was applied to select an esthetically 
pleasing cover design for the questionnaire, to find a title 
which would arouse interest, to choose an attractive page 
format, to determine upon a size and style of type that would 
make the final printed questionnaire easily readable under 
poor illumination and by people with less than normal vision, 
and to find photographs of high interest value and appropti- 
ateness to illustrate the various sections of the questionnaire. 

“One of these experimental studies of esthetic preferences 
will be cited as representative of this type of pretesting. The 
purpose of this experiment was to select a page format that 
would be esthetically pleasing and to ascertain the extent to 
which esthetic preferences, as they relate to design of ques- 
tionnaires, are,similar from person to person and for similar 
groups of persons. The spacing of photograph, page title 
and text, and the character of the decorative lines used to 

21 Columbia University Press, 1931. i 
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integrate these elements of the page, are factors affecting the 
esthetic impression produced by a page design. 

“Ten different page formats, prepared by an artist who 
had special qualifications for this work, were submitted to 
one university class containing 23 students and to another 
containing 24. Art students were avoided in this experiment 
on the assumption that they were likely to differ more from 
a cross-section of the former university population than do 
students in other university classes. Each person in these 
two classes was requested to select the page he liked best and 
the one he liked least among the ten submitted. The same 
photograph, title, and text were found on each page in order 
to control these factors. The spacing of these elements and 
the decorative lines employed were the only variables, from 
page to page. ; j 

“Students in this experiment were cautioned not to com- 
municate with each other until after their choices had been 
recorded on slips of paper and collected. When these choices 
were tabulated, it was found that two of the'ten designs were 
much superior to the others, since they received 28 of the 47 
votes as most preferred and only three of the 47 negative 
votes as least preferred. The two least liked designs were 
accorded 32 of the 47 negative votes and none of the positive 
votes. The remaining 19 positive votes and 12 negative votes 
were distributed among the other six designs. When these 
10 designs were ranked according to the number of positive 
votes obtained in each group, and these ranks were corre- 
lated for the two groups, a rank order coefficient of .74 was 
found. When the designs were ranked according to the num- 
ber of negative votes, the rank order correlation coefficient 
was .yo. This result indicates that aesthetic preferences are 
not highly individualistic and erratic, but rather that a strong 
consensus of preferences exists within homogeneous social 


groups.” 7 
Fs PRETESTING OF QUESTIONNAIRES 


The final revision of a questionnaire should be made on 
the basis of actual test on a public as nearly as possible like 
the one to be investigated. Not even experienced investi- 


22R. F. Sletto, op. 4. pp- 198-99- 
v 
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gators thoroughly familiar with the subject, the fiéld, and 
the technique of questionnaire construction can foresee all 
of the difficulties of interpretation and response that the 
recipients of the schedule will bring to his attention. The 
inexperienced researcher is likely to be impatient with this 
preliminary work, which may seem like hair-splitting over 
the meaning of words, and other details. But patience and 
care in this preliminary work may make all the difference 
between success and failure, both in the cooperation of the 
respondents and in the reliability and validity of the results. 
Sletto reports that more than six months were spent in the 
construction of the questionnaire described above, every page 
being revised at least five times before reaching final form.** 
Ina a see a (Chap. IX) we shall give a more detailed 
account of the construction and pretesting of a scoring sched- 
ule. Although this type of schedule differs in important 
respects from the questionnaire, many of the principles gov- 
erning the construction of both types are the same. 

The general ptrpose of pretesting a schedule is to deter- 
mine its reliability and validity. This is important whether 
It is possible to improve the schedule for actual use or not, 
as this knowledge forms the basis for our estimate of the de- 
gree of significance to be attached to the results. We have 
already touched briefly on methods of estimating the relia- 
bility of a sample in our chapter on Sampling (Chap. V, 
p- 147). We shall return to the subject of statistical reliabil- 
ity and errors again in the next chapter. This subject is, 
however, a major concern of the field of statistics and the 
student will need to be familiar with the elements of proba- 
bility theory for a real appreciation and discriminating use 
of the various tests of reliability. The present treatment, 


therefore, aims only to call attention to some of the problems 
involved. 


G. RELIABILITY 


__A schedule, questionnaire, or other instrument is reliable 

if it gives the same or highly similar results when reapplied 

to the same or highly similar phenomena. Two census takers 

canvassing the same community on the same day with the 
33 Ibid., p. 199. 4 
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schedule shown in Fig. 2 should bring in the same infor- 
mation. If they do not bring in the same results, the dis- 
crepancy must be due to (1) variation in the interviewers’ 
interpretations of the questions; (2) variation in the record- 
ing of the answers; or (3) variation in the informants’ an- 
swers on the two occasions, such as might occur, for example, 
from the fact that each interviewer interviewed different 
members of, the family. Im the case of a questionnaire, we 
would call it unreliable if it failed to bring in the same in- 
formation when repeatedly submitted to the same person, 
assuming that it called for facts which had not changed be- 
tween the two occasions. In actual studies involving num- 
bers of cases we consider the questionnaire reliable if the 
percentage answering a certain way or the mean the dis- 
tribution and its standard deviation remain fairly stable 
when the questionnaire is resubmitted to the same popula- 
tion or to another population we believe, from other evi- 
dence, to be highly similar. Thus it was found that ques- 
tionnaires on regularity of radio listening toleertain programs, 
if repeated within 15 days, gave results varying by a maxi- 
mum of 8 percent.** Another research organization re-inter- 
viewed 10,000 people in rural areas and found that the re- 
ported average daily listening for the whole group differed 
by only five minutes in the two studies.*° This is not to say 
that there may not have been great discrepancies in individ- 
ual reports. The study merely shows that sampled informa- 
tion on listening habits was reliable in given cases, if taken 
for a group as a whole. 

The reliability of questionnaire responses will vary greatly 
according to subject-matter, the public questioned, and the 
time elapsing between questionnaires. Responses for an en- 
tire group will usually be more stable (reliable) than indi- 
vidual responses. Bain ** found that college freshmen changed 
their replies on nearly one-fourth of 3050 items after a pe- 
riod of two and a half months, although they were urged to 


24 Alberta Curtis, “The Reliability of a Report on Listening Habits,” Jour. 
Applied Psychol., 23:127-30, 1939- also, “Checking the Checkers, Adver- 
tising and Selling, Dec. 19, 1935: Also “Dependability of Psychological Brand 
Barometers,” Jour. Applied hol., 22:1-7, 1938. 

35 Alberta Curtis, op. cit., p. 129. r 

26 Read Bain, “Stability in Questionnaire Response,” Amer. Four. Sociol, 
87:445-53. 1931- P3 


G 


200 SOCIAL RESEARCH | 


answer the questions the same way as nearly as possible. The 
questions covered factual family data, factual personal data, 
and subjective personal data. Girls showed uniformly greater 
stability of response than boys, and the whole group showed 
greater stability on factual personal data than on other types 
of response. These results were confirmed in a highly simi- 
lar experiment by Smith?” A somewhat similar study by 
Cavan * on eighth-grade childrén at an interval of only one 
week showed changes in response on only about 13 percent 
of the items. The individual stability of questionnaire re- 
plies is probably one which has to be estimated in each case 
rather than according to any general rule, on account of the 
large number of variables likely to be operating. 

te from any fault of the schedule, we know that there 
will be some variation in the results on account of the opera- 
tion of “pure chance,” i.e., multiplicity of small causes which 
tend to balance each other in a large number of cases, ¢.g:5 
mere mistakes in speech, reading, marking the schedule, etc. 
The probability'that an observed difference in the obtained 
results is due to these chance “accidents,” however, can be 
estimated from the known laws of probability that have been — 
developed from observations of how chance phenomena dis- 
tribute themselves in frequency, as, for example, the distribu- 
tion of heads and tails when coins are tossed á&'large number 
of times. On this basis we can estimate what the probability 
is that the observed difference in Its of two or more: ` 
trials of a questionnaire is due to chance. If the difference 
secured is so small that the likelihood is high that it would |` 
tend to disappear as the number of questionnaires increased, 
we attribute the discrepancy to “sampling error,” measured 
by the formula given on p- 147. If, however, the discrep- 
ancy between the results of two trials of the questionnaire is 
so large that, by the laws of probability, the likelihood is 
small that such a difference could be due to mere chance 
Variation, we must look for the reason for the variation in 
some ambiguity in the questionnaire. In the case of sched- 
ules filled in by an interviewer, there is, of curse, the addi- 


3 M. Smith, “A Note on Stability i t i r . Jour. 
UE evista, vane n ility in Questionnaire Response,” Amer. F 


38 Ruth S. Cafan, “The Questionnaire i iologi ject,” 
Amer, Jour, Sociol. eater — re in a Sociological Research Proj 
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tional possibility of consistent and persistent misinterpreta- 

tion of a question or a persistent misrecording of the answers 

by one or several interviewers. By giving a questionnaire to 

a trial group on different occasions and comparing the re- 
sults, the less reliable questions, i.e., those showing a lack of 
stability in their responses, can be identified and modified. 

~ Corroboratory questions, e.g., asking both “Age” and “Date 
of Birth,” may also be used toa certain extent as a check on 
reliability. å i 


H. VALIDITY 


The validity of the results of a questionnaire refers to the 

extent to which these results correspond to some criterion 

` which we regard as sufficiently valid to accept the agreement 
of the questionnaire results with it as a test of the validity of 
= the questionnaire results. In some cases such criteria of a 
final sort are provided by subsequent events. That is, when 
the election results are predicted by a questionnaire, the 
questionnaire is validated by the results of the election. 
More frequently the results of questionnaires are considered 
s valid on the basis of the dangerous criterion that the results 
correspond with “commonsense” or with what seems “rea- 
sonable.” In the absence of other criteria, and always in 

. addition, o. them, commonsense is entitled to consideration. 
But in view of the fact that systematic questionnaire studies 

a are undertaken be we do not trust the commonsense 
> estimatés, we can hardly use the latter as sole criteria of the 

-* validity’of the former. More justifiably the judgment of ex- 

_ ‘perts is relied on. Again, the results of the questionnaire 
- may be compared with results on the same questions obtained 

by other methods thought likely to yield more valid re- 
sponses than the questionnaire, such as oral interviews. When 
the questionnaire asks for certain types of facts, individual 
check is, sometimes possible from’ reliable public records 
such as birth certificates, real estate titles, etc. 

It is probably true that the estimated validity of question- 
naire results as frequently presumptive and inferential 
+ rather than direct, “We assume high validity when we have 

. guarded against. the known possibilities of error, when there 

is no known reason why informants should Mntentionally 
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mislead, and when the results are not inconsistent with other 
known facts about the respondents or with other measures, 
the validity of which is accepted. We shall return to the 
subject of validity in a later chapter in connection with meas- 
uring scales. In the end, the validity of all instruments will 
be determined by their usefulness in serving the discrimina- 
tory purposes for which they were invented. These purposes 
are likely sooner or later to involve practical adjustments of 
a kind which yield a positive (i.e., generally accepted) answer 
to the question of validity of our instruments.” 


I, SENDING OUT THE QUESTIONNAIRE 


When all the above precautions have been taken with 
respect to the letter of solicitation, the content, structure, 
wording and form of the questionnaire, and when it has 
been pretested and revised on the basis of revealed weak- 
nesses, we are ready to send it out to the population selected 
for the inquiry. That population will usually be a sample, 
and if so, all of the principles of valid sampling discussed in 
the preceding chapter apply at this point. 

It is usually desirable to send out all of the questionnaires 
at once, or so that they will be received about the same time. 
Otherwise some recipient may become suspicious that he has 
been singled out for special inquiry. It also makes it easier 
to conduct the follow-up work and tends to get the returns 
in at approximately the same time. As to the time of the 
week when a questionnaire should arrive in order to stand 
the best chance of being filled out, Toops advances the the- 
ory, based on some data, that questionnaires tend to be filled 
out during week-ends. “Accordingly, the questionnaire 
should not arrive at the recipient's home so early in the 
week that he will lay it aside and forget it, or throw it in the 
waste-basket for lack of time to answer it immediately, but 

E] . i 
oss Saves nate meee meters me 
plumbing, automobile radiators, etc., always ensue. The thtory of the expan 
sion and contraction of matter with changes in tempetature is merely the 


verbal rationale and lization of the practical experience noted. For 


further discussion of the validity of measu instruments, Founda- 
tions of Sociology, (Macmillan, 1939), pp. au id 
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ee. 
rather should arrive about Friday night or Saturday morn- 
ing, whereupon it may receive immediate attention.” * 


J. FOLLOW-UP LETTERS 


The time to send follow-up requests will usually be deter- 
mined by the rate at which returns are coming in. When 
returns show a distinct falling off, some kind of a follow-up 
should be brought to bear upon those who have not yet re- 
plied. This may be in the form of a letter together with the 
original schedule, a double postcard * with only the more 
important questions, a telephone call, or an interview. If 
the telephone or interviews are used, it may be sufficient to 
take only a random sample of the non-respondents. This 
involves all the problems of valid sampling, however, and, 
in the case of telephone interviews, the biasing influence 
that telephones are associated with economic status. 

How many follow-up efforts are desirable is, of course, a 
matter of how important a larger percentage of replies is 
considered from the standpoint of the representative nature 
of the sample secured. This can be determined by a tabu- 
lation of the results of some of the replies already received. 
“One should compute daily the cumulative averages or per- 
centages on the four or five most significant and essential 
questions of the questionnaire. When these significant aver- 
ages and percentages vary but little from week to week, one 
has some reasonable assurance that the follow-up has pro- 
ceeded about far enough.” * Toops received replies from 
16 out of a remaining 24 as a result of his fifth follow-up let- 
ter to former college students. Six follow-up letters se- 
cured 100 percent returns on 110 questionnaires to college 
professors and administrators.** 

3H. A. Toops, “Validating the Questionnaire Method,” Jour. Personnel 
Research, 2:157-58, 1923+ 

31 These are regular postcards, clipped together, one to be torn off and re- 
turned, Sletto (op. cit.) found that posteards yield the same frequency of 
response as letters when used as follow-up stimuli. Ț 7 k 

Herbert’ A. Toops, “Validating the estionnaire Method,” op. cit, pp- 
153-69. This article also contains suggestive copies. of types of follow-up 


“H. A. com. “The Returns from Follow-Up Letters to Questionnaires,” 
Four. Applied Psychol., 10:92-101, 1926. i 
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e . 
Stanton? sé€ured an increase of 75 percent (from 28.3 
percent to 50.2 percent) through his first follow-up to a ques- 
tionnaire to teachers on the use of radio equipment in the 


classroom. The same author also reports a g4 percent return ' 
on a nine-page discussion and questionnaire\as a result Of - ; 


pretesting the questionnaire and, three fulltength follow- 
ups** (one when g4 percent of the replies ‘ad been received, 
the second when 72 percent, and the third when. 8g"percent 
had been returned). tn i BS ae 
~~ Suchman and McGandless 37 report the ‘results of the fol- 


v 


lected from the telephone directory ofyCedar Rapids. and 
Iowa City. A questionnaire was sent to ach Woman. When 
not more than one return per day was received for five suc- 
cessive days, a second wave of the same’ questionnaire was 
sent to all who had not answered. After a similar period of 
waiting, a random sample of every second person of ‘those 
who had not answered was called on the telephone for their 
answer., This sample was considered representative of all 


lowing technique : x stipe list of 606 awomen was se- ` 
p 


those who had not returned the questionnaires. ise 
The results are indicated in Table II. j 
TABLE II38 
Proportion of Replies to Successive Waves of a Questionnaire on 
vt, Radio Programs een 


Percent replies 


First wave (m . 16.8 
Second wave (mail) 34.1 
Residual sample (telephone) 97.2 


The same authors report the following experience with a 
random group of goo subscribers {selected from a list: of 
10,000) to a booklet listing the music broadcasts of a brodd- 
casting station. The restilts are given in Table III. 


2 
Frank Stanton, “Notes on the Validity of } Quettionna ” 
Pr- Applied Psychol., 23:95-104, inn H 10m Fry a 


"E. « Suchman and B. McCandless, 
Jour. ied Psychol., 24:758-69, 1940. 


e 
“Who Answers Questionnaires ?” 
38 Ibid., D. 759. 
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TABLE III* 


Proportion of Replies to Successive Waves of a Questionnaire 
on Radio Broadcasts i 


“f 
hf Questionnaires 


ssent 


b 

ginal goo were eliminated from the sample on account 
vailability. See op. cit., p. 762, footnote, 

* " 


ave 4° = 

‘The amount of follow-up necessary in any given inquiry 
will vary.greatly with such factors as (1) the mandatory power, 
prestige, and personal relations of the investigator to the 
public investigated, (2) the simplicity of the questionnaire, 
($) the educational level of the group solicited, and above all 
with (4) the interest of the public solicited in the subject 
under investigation. (5) Time and resources of the investi- 
gator and the degree of certainty needed in the particular 


inquiry will also determine how far to carry the follow-up. 
; K. WHO ANSWERS QUESTIONNAIRES ? a 


When a questionnaire has been sent out, a certain number 
will be answered quite soon, others will trail in for a period 
of some months, and some will not be returned at all. One 
question of great significance from the standpoint of the- 
value of the conclusions is: Are the questionnaires returned 
representative of the results that would have been secured if 
all had been returned? 7 

The purpose of many of the suggestions in: this chapter 
has been to secure a high percentage of returns. The experi- 
ence we haye reviewed in the preceding section shows that 

m roid, pay ‘ 

For a tabulation of how one commercial questionnaire to teachers was 
returned in terms Of percentages at five-day intervals over a period of 122 
days, see F. Stanton, “Validity of Questionnaire Returns,” Jour. At pled Psy- 
chol., 23:98, 1939. See also, E. A. Suchman and B. McCandless, “V An- 
swers Questionnaires ?” Four. Applied Psychol., 24:758-69, 1940. 
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people vary in the amount of stimuli required to cause them 
to respond to questionnaires. The results actually achieved 
in different studies vary so widely and are conditioned upon 
so many variables as to make any generalization of very dubi- 
ous value. We refer here to questionnaires which the re- 
cipient may ignore without penalty, and not official inquiries 
like the income tax blanks, or other questionnaires with co- 
ercive powers behind them. Most ordinary studies as con- 
ducted by private and relatively unskilled persons yield only 
from 5 to 20 percent of returns.*4 Certainly no reliable gen- 
eralization can be based on such returns, because, as we shall 
see below, the sample is practically certain to be biased. 
That is, people who do not answer are likely to be different 
from those who do, also in other respects relevant to the in- 
quiry. Even when a very high percentage of returns is se- 
cured, this bias is likely to be present. For example, Sletto * 
secured 6g percent returns from 1381 former university stu- 
dents on a 52-page questionnaire which had been very care- 
fully prepared and pretested. Even in this large sample, it 
was found that 75 percent of those who failed to return the 
questionnaire were non-graduates, as contrasted with 60 per- 
cent non-graduates in the whole group. This suggests that 
even when the proportion of returns is high, biased results 
are quite possible. In short, a high percentage of returns is 
not in itself a guarantee of adequacy. As we have noted in 
Chapter V (p. 151), public opinion polls have brought to 
light some striking data regarding the selective workings of 
questionnaires in this field.43 

From the experience of competent and large-scale users of 
questionnaires we may summarize the present state of knowl- 
edge on the subject of the selectivity of returns as follows: 

1. Non-readers and non-writers are automatically excluded 
from participation. How seriously this may affect a study 
depends, of course, upon the public it is desired to. reach. 
It must be remembered, however, that for purposes of an- 

4F. Stanton, op. cit., p. 102. D 


4R, F. Sletto, op. cit. ; see also, R. C. Pace, “F. i i 
3 , Op. cit. ; » R. C. . “Factors Infltencin: uestion- 
naire Returns from Former University Students,” Jour. Applied Eek, 23: 


+ 1939. 
43 % : : i ; 
we imo Roper, Sampling Public Opinion,” Jour. Amer. Statis. Asso., 35: 
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swering questionnaires the census definition of illiteracy is 
much too simple. Perhaps 25 percent of the general adult 
population of the United States is illiterate for questionnaire 
purposes.** 

2. Interest in, or familiarity with, the topic or product 
under investigation is’ Unquestionably a major factor in de- 
termining the return of a questionnaire (Stanton,*® Watson,*° 
Suchman and McCandless*"). Non-users of a product an- 
swer in much smaller proportions, whether from a lack of 
interest or a reluctance to admit non-use (Salisbury **). This 
effect has been noticed especially in connection with per- 
sonal and, luxury products.*? 

3. The better educated are, generally speaking, more 
likely to return questionnaires than the less educated (Such- 
man and McCandless *°). 

4. Articulateness (measured by number of words written 
in response to “free answer” questions) is correlated to some 
extent with intellectual training but not with characteristics 
such as age, income level, size of community, marital status, 
or occupation of husband. Differences in articulateness are 
associated with differences in interests, which may themselves 
be the result of differences in amount of education. For ex- 
ample, a study of preferences in radio programs showed that 
the less articulate prefer needlework, dramatic programs, 
while the more articulate prefer reading and classical music.™ 

g. Follow-up procedures are important in decreasing the 
bias of the returns from the original request, because of the 
selective influences noted above. Follow-up procedures are 
therefore preferable to a mere increase in the number of 
original questionnaires in order to secure a return of a cer- 
tain size, since the larger number would be likely to have 
ye See H. R. Huse, The Illiteracy of the Literate, (Appleton-Century, 1933)» 

8 Op. cit. 

a hrr fnvestigations by Mail,” Market Research, 7:11-17, 1987- 

48 Philip Salisbury, “Eighteen Elements of Danger in Making Mail Surveys,” 
Sales Management, 34:28, 30, 84, 85, Feb. 15, 1938. 

49 Ibid. : 

50 Op. cit. i 

51 Isabelle Wagner, “Articulate and Inarticulate Replies to Questionnaires,” 
Jour. Applied Psychol., 238:104-15, 1939- 
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r 
the same biases as the smaller number (Suchman and Mc- 
Candless,® Rollins 5). 

6. The supposed greater economy of questionnaire studies 
as compared with interviews is seriously questioned by some 
. authorities, if the slowness of securing returns (at least three 
weeks) and the cost per return is considered.™ * 


L. CONCLUSION 


Experience with mail questionnaires has shown that care- 
lessly prepared schedules distributed to the general public 
yield from 5 to 20 percent returns. We have shown also that 
on account of the variation in the tendency to respond to 
questionnaires, generalizations from such results are practi- 
cally worthless. In fact, they may be worse than worthless, 
because they frequently create the impression among the un- 
wary that the subject has been studied “scientifically” and 
that the results have a validity which they clearly do not 
have. Such results are actually of little more consequence 
than those of the traveler and journalist who makes a trip 
abroad or from coast to coast talking with “lots-of people,” 
a taxi driver here, a grocer there, and usually finding that 
“public opinion” is practically unanimously in favor of his 
own viewpoint. 

On the other hand, we have shown that when a question- 
naire is carefully prepared according to the principles we 
have discussed, it is an instrument capable of yielding reli- 


able results. The principal points to which we have directed 
attention in this connection are ; 


(1) A skillful appeal for cooperation. 
(2) The enclosure of a stam envelope for reply. 
(3) Care in the wording and structure of the schedule. 
is Its esthetic appeal. 
(5) Adequate pretesting for wil 
no St 
« G. Rollins, “A i j JOM: 
Applied Prychole 477098. 190 a a 2 
Frank Stanton, op. cit., says, “When one considers the necessary wait that 
is a part of mail surveys —and the cost per return in addition to all other 
shortcomings — the advantages of the personal-interview technique far out- 


weigh whatever advantages may be cited , 
shoul He Prapa, pA y be cited for the mail procedure” (p. 99) h 


rely on interviewing rather than on mail questionnaires. 
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(a) its capacity to evoke response, 
b) reliability, and u 
c) validity. 
e) Revision upon the basis of these tests. 
7) Sufficient follow-up effort to insure the representativeness 
of the replies received. 


F+ 

We have been concerned in the present chapter with mail 
questionnaires asking for information of any kind, from per- 
sonal factual data, such as age and sex, to questions about 
beliefs, opinions, and attitudes. In the next chapter we 
shall consider in greater detail some instruments for secur- 
ing the latter type of information. This will involve, in 
part, a more intensive consideration of a subject on which 
we have already touched, namely, the wording ot inquiries 
or other forms of verbal stimuli to which we wish to secure 
responses.» We shall be primarily concerned, however, with 
questions of measurement of attitudes through specially 
constructed tests and scales, by means of which the responses 
can be reduced to scores lending themselves to more refined 
analysis than is possible with ordinary questionnaire replies. 


kd 
M. SUGGESTIONS FOR FURTHER STUDY, 


1. An excellent brief discussion of a careful experiment 
with questionnaires will be found in R. F. Sletto, “Pretest- 
ing of Questionnaires,” Amer. Sociol. Rev., 5:193-200, 1940. 
The student will find it worth his while to read this report 
in full. 

2. Two complete numbers of The Journal of A pplied Psy- 
chology (Vol. 23, February, 1939, and Vol. 24, December, 
1940), are devoted entirely to the techniques and results of 
recent research on the responses of audiences to radio broad- 
casting. Several excellent papers on experience with ques- 
tionnaires as well as other techniques will be found in the 
mentioned numbers of this periodical. 

3. On the use of questionnaires in market research, see 
The Technique of Marketing Research, (McGraw-Hill, 
1937), Chaps.g, 4, 11, and 15, by P. F. Lazarsfeld. 

4. On the wording of questions, ‘an excellent discussion 
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will be found in G. Gallup and S. F. Rae, The Pulse of De- 
mocracy, (Simon and Schuster, 1940), Chap. 7. 

5. For an excellent review of the kinds of bias that may 
creep into public opinion polls, see a statement by Professor 
Ross Stagner in the Congressional Record for May 9, 1941, 
pp- 3923-25. For a more detailed statement by the same au- 
thor, together with a reply by G, Gallup, see “A Comparison 

¿of the Gallup and the Fortune Polls Regarding American 
Intervention Policy,” Sociometry, 4: Sept. 1941. 

6. Wilson S. Miller and E. J. Engquist, Jr., “On the Effec- 
tiveness of Follow-Ups and Mail Canvasses”; Bulletin, Amer. 
Statis. Assn., pp. 189-190, Jan. 1942 


CHAPTER VIII 


THE MEASUREMENT OF ATTITUDES AND 
OPINIONS 3 


¢ pi 

I see no more drawbacks in thë path of setting up a branch of government 
whose job it is to determine by proven methi of selective sampling what | 
the people want than there were in the path of setting up any of the existing - 
branches of government... If such a branch were set up it would obviously 
have to be run by people with the same high ideals and sense of detached 
objectivity which are supposed to and in the minds of many do characterize 
the Supreme Court. I see no reason to suppose that a man automatically 
loses his integrity and his sense of objectivity when he is appointed to high 
office. Such a department could act as a sounding board for the public to 
inform its elected representatives more fully as to their desires than their bal- 
lot box vote could do. The public would then have an opportunity to inform 
its elected representative that he was elected because o the virtues of his 
planks 1 through 6, and despite the vices of plank 7 and thus admonish him 
not to regard his election as a mandate for putting this undesirable plank 7 
into law. — Elmo Roper.t 
A. PRACTICAL ASPECTS OF THE MEASUREMENT OF ATTITUDES 


AND OPINIONS 


Whether one agrees with the above statement or not, it 
should serve to emphasize the practical implications of the 
interest in the measurement ofyattitudes and opinions. At- 
tempts to gauge attitudes and opinions are as old as society 
itself. In small primary groups this measurement could be 
carried on with sufficient reliability through informal per- 
sonal contact. With the development of rapid means of 
communication and the consequent interdependence of re- 
mote groups in a national or an international society, the 
demand for more formal and systematic methods of measur- 
ing opinions has arisen. Political and social leaders rely on 
various techniques under these circumstances. Some main- 
tain a network of political agents reaching into eyery local 
mesial In some places these function openly in their 
„avowed capacity, or even as a type of social worker. In other 
places, they take the form of the secret police. Under rela- 

tively stable*conditions these functionaries tend to be sup- 
_ plemented by a vast array of self-appointed journalists, news- 
papermen and propagandists who undertake to “reflect” 


1“Sampling Public Opinion,” pn Amit Statis. Asso., 35:383, 1940. 
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opinion. - All of them are engaged in estimating, reporting, 
and trying to influence opinions and attitudes. 


B. DEFINITIONS OF ATTITUDES AND OPINIONS 


We are interested in this chapter not so much in the prac- 

~~ tical aspects of attitude and opinion measurement as in the 
e scientific problem of how to observe reliably and to general- 
ize from a subtle type of social behavior. The behavior 
which we define as attitudinal or as an attitude is a certain 
observable “set” of the organism or a reaction tendency pre- 
paratory to, and indicative of, more complete adjustment. 
In the present discussion, we shall use the term “attitude” 

to denote the general set of the organism as a whole toward 
an ene or situation which calls for adjustment.? As such, 


it includes all the neural and other physiological sets and 


postures toward a situation, and their psychological corre- 
lates, commonty designated I by such terms as “inclinations, 
feelings, prejudice, bias, notions, ideas, fears, threats and con- 
victions.. Thus a man’s attitude about pacifism means here 
all that he feels and thinks about peace and war,” ® as evi- 
denced. by some objectively observable behavior primarily of 
a preparatory, symbolic, or »partially effective nature.4 In 
short, studies of attitudes are concerned with the so-called 
subjective side of human behavior. (See Chapter I, Sec. G.) 

For convenience in discussion it is well to distinguish two 
types of attitudes: (1) The various overt bodily postures or 
muscular tensions preparatory to some overt adjustment 
such as, for example, the readiness for defense, flight, grasp- 
ing objects, etc. (2) The emotional or intellectual (“inner,” 
‘psychic”) attitudes, which are observable (a) through our 
own kinesthetic and subvocal responses, and (b) through the 
symbolic, behavior — mainly language — of others. 

2¢f.L. L. i à 

E “Thuestone “Ativuder ene pan i rot. for, Sociol., 33:529- 
54 1928. For an excellent analysis ofthe hopeless confusion in current usage 


of the term “attitude,” see Read Bain, “An Attitude on Attitudes,” Amer. 

Tour. Sociol., 33:940-47, 1928. 

an ear. op. cit, p. 246. G. W. Allport gives the following definition : 
n attitude is a mental and neural state of readiness, organized through ex- 

Leen ye exerting a directive or dynamic influence upon the individual's re- 

ponse to all objects and situations with which it is related.” Handbook of 


Social i ; = Oe 
Siida eae » (Edited by C. Murchison, Clark Univ. Press, 1935), p- 810- 
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Undoubtedly, one difficulty in attitude measurement is 
the variety of phenomena the term is used to denote. Sym- 
onds® records seven ways in which the term attitude is em- 
ployed in current literature, viz., (1) great organic drives, 
(2) muscular set, (3) generalized conduct, (4) neural set or Ww 
readiness to adjust, (5) emotional concomitant of action, (6) 
feeling concomitant of action, (7) accepting or rejecting ver- » 
bal responses. Insofar as any or all of these phrases describe, “ 
objectively observable phenomena they are legitimate sub- 
jects for study and measurement. However, our techniques 
of observing, classifying, and generalizing in the case of some 
of these different types of data are more highly developed 
than in the case of others. The type which falls perhaps pe- 
culiarly within the domain of the social sciences is that of 
verbal behavior, (Class 7 in Symonds’ list). For the most 
part, this behavior takes the form of opinions. The present 
chapter, therefore, is chiefly concerned with methods of 
measuring verbal behavior which takes the form of opinions 
and their possible correlation with other forms of behavior. 

LaPiere? has helped clarify the nature of attitudinal be- 
havior by the following classification of all human behavior 
into four phases or levels : 


(1) Overt-symbolic, which includes the acts of speaking, writing, 
and gesturing; (2) overt-nonsymbolic, which includes such di- 
rectly significant acts as driving a car and closing a door; (3) 
covert-symbolic, or what is commonly designated as thought; 
(4) covert-nonsymbolic, or what is usually described as feeling- 
states and emotions. 


Class (1) is clearly the phase with which most attitude stud- 
ies hitherto have been concerned, and will be our chief 
concern in this chapter. Class (2) refers to a phase of be- 
havior which most students would probably declare non- 
attitudinal. Classes (3) and (4) are perhaps generally re- 
garded as attitudinal, but are by some considered not subject 
to measurement. We shall consider this question in the 
next section. | 

6p. M. Symonds; “What is an Attitude?” Psych. Bull., 24:200-01, 1927. 
See also, G. W. Allport, op. cit., pp. 806-10, for an excellent summary of pres- 


ent uses of the word “attitude.” 
IR. T. LaPiere, “The Sociological Significance of Measurable Attitudes,” 


Amer. Sociol. Rev., 3:175-82, 1938. 
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The study of attitudes, as defined above, is important in 
three ways. First, it is important from the standpoint of 
successful and peaceful living with other people. Obviously 
it is of great importance to know the emotional and intellec- 

> tual behavior-pattern, ideologies, etc., of a group before for- 
mulating any plan for motivation, organization, or control 
i of the group. It is important “to be able to recognize the 
> © significant types of attitudes in an adjustment situation where 
it is necessary to guide or divert responses. ‘The orator, actor, 
salesman, advertiser especially, must be acquainted with the 
language of attitudes if he wishes to gauge successfully and 
control adequately his efforts.”* Second, a knowledge of 
the processes by which attitudes are built up is important 
from the standpoint of social control. “The social and edu- 
cational psychologists desire to know ‘how the attitudes are 
built up and what environmental pressures — educational 
and social—to bring to bear upon individuals and groups 
of individuals to build up the desired response mechanism.” ° 
There is obviously great practical usefulness in a reliable 
method of determining and of measuring attitudes, because 
it would give us a measure of predictability of concrete be- 
havior “without going to the elaborateness of studying Con- 
crete life conditions, for after all, the ultimate aim of per- 
sonality-measurement is certainly the devising of short-cuts 
to the prediction of human behavior, and thus to afford a 
means of controlling it.”1° Third, the principal method of 
defining true social groups may be to discover what popula- 
tion groups have common attitudes, rather than assuming 
that they constitute social groups merely because they live in 
the same geographic area, or political subdivision, or receive 
an equal income, or because of other criteria which may be 
largely irfelevant as indicators of true group boundaries or 
“fellow feeling.” In short, the attitudes of a group, espe- 
cially in the form of symbolit behavior, constitute a large 
and important part of the phenomena necessary to describe, 
‘ explain, predict and control the behavior of that group. 
_ §Bernard, op. cit. See also, D. Katz, “Attitude Measurement as a Method 
in K erteni eai 15:479-82, 1937- th 


10 Kimball Young, “The Measuremen i its,” Publ. 
Aimer, Sell. pag oie prom t of Personal and Social Traits,” Pu 
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C. THE MEASURABILITY OF ATTITUDES 


It is the apparent intangibility of such phenomena as atti- 
tudes that has given rise to the feeling that such data are not 
observable or measurable. If so, this interposes_a serious 
obstacle in the way of a very exact science of group behavior. 
But, as we have seen, more critical examination reveals that, 
the difference between such phenomena and “physical” phet” 
nomena is more apparent than real. 

An attitude as defined above is obviously a complex and 
many-sided phenomenon. To argue that it can be wholly 


described by a single numerical or other index would indeed 
be a dangerous contention. But as Thurstone has pointed 
out, “for the proble of measurement this statement is an- 
alogous to the observation that an ordinary table is a com- 
plex affair which cannot be wholly described by any single 
numerical index. So is a man such a complexity which can- 
not be wholly represented by a single index. Nevertheless 
we do not hesitate to say that we measure the table... The 
context usually implies what it is about the table that we 
propose to measure (¢.g., its height, its cost, or beauty, or 
degree of appropriateness, or the length of time required to 
make it)... We say without hesitation that we measure a 
man when we take some anthropometric measurements of 
him. The context may well imply without explicit declara- 
tion what aspect of the man we are measuring, his cephalic 
index, his height or weight or what not.” 11 In the same 
way, if we specify or imply by the context what aspect of 
people’s attitudes we propose to measure, it is just as per- 
missible for us to say that we are measuring attitudes as to 
say that we are measuring tables or men. 

This view also meets the objections that have ‘Been raised 
to the idea of measuring ¢overt-symbolic behavior, com- 
monly designated as “thought,” and covert-nonsymbolic be- 
havior, usualy called “feeling-states” and “emotions.” Take, 
for example, the statement: “The psychogalvanometer may: 
reveal changes in a criminal’s feeling-states, but what any 
person is actually feeling and actually thinking cannot be 


11 L. L. Thurstone, op. cit., pp- 53% 531+ 536. 
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measured and is still anybody's guess.” 1? This statement 
seems to overlook the fact that no measurements ever measure 

all aspects of any phenomenon. Measurements aim only to 

record reliably a single aspect, namely, the aspect that is spe- 

cified in the construction and legitimate use of the measuring 

inst E If so, we may point out that if the 
_-psychogal jometer “reveals changes” consistently and re- 
` liably in a criminal’s feeling-states, the observable concomi- 
tant fluctuations in the two is of the essence of measure- 
ment. The significance of this consistent fluctuation for 
stated purposes, and the further correlation of the psycho- 
galvanometer readings with still other phenomena in which 
we are interested, are questions to be answered in each case 
by actual research on that problem. Likewise, the statement 
that “what any person is actually feeling and actually think- 
ing . . . is still anybody's guess” is undoubtedly largely true, 
especially if by measurement of thought and feeling is meant 
that the measurer must actually himself experience the 
thoughts and feelings of the person he is measuring. But 
this is surely a preposterous requirement which could not 
be met by any measurement in any field. 

For our purposes it is enough to point out that “guessing” 
(“judging,” “estimating,” etc.) what other people think and 
feel is a daily pursuit of almost everybody, and that some 
are very much better guessers than others. We are interested 
in discovering and improving still further the technique of 
good “guessing.” We do not expect that any one technique 
will ever enable us to guess all that voters thought about a 
certain candidate. We are content to be able to “guess” 
only, for example, that on the basis of our measurement of 
what they thought on a certain subject, as revealed by cer- 
tain Nek. arii a a certain proportion (within one-half 
of one percent of the actual proportion, for example) would 
Vote a certain way on a certain day. It is on the basis of 

12 i ‘ : 
tation that this ecient aRicle is Incesmpcnibig Sith the creat ot the 
sows dae age The main thesis of LaPiere’s paper is the unjustifiability of 
pir ne er Peres ee evidence, any necessary correlation bes 
sized too strongly, and is treated As poe bat, iy ogi carl 


18 “What is here significant is that all compari i 4 
a è parison is of the nature of meas 
urement.” John Dewey, Logic. The Theory of Inquiry, (Holt, 1938), p. 202- 
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such mundane performances that the question of what is 
or is not measurable will be settled in the end. A priori 
declarations on what is or what is not measurable, will con- 
tribute nothing to knowledge. But science has assumed in 
the past as a working hypothesis that some things not meas- 
ured at a given time are not necessarily immeasurable if the 
proper instruments are developed. Scientists must continue 
to work on this hypothesis. p 

Now the particular form of attitudinal behavior which 
interests us most from the standpoint of sociology is that 
which serves.as a stimulus-response mechanism between peo- 
ple, and which significantly affects their social interaction 
and their adjustment to their environment. By far the 
greatest and most important part of this attitudinal behavior 
is in the form of language. In its more complete form this 
behavior takes the language form of opinions. The princi- 
pal aspect of attitudes which we are interested in measuring, 
therefore, is that aspect which takes the form of opinions 
expressed in language. 


D. THE RELATION OF VERBAL ATTITUDES TO OTHER BEHAVIOR 


In how far is significant measurement of mere opinions 
or other language expressions an index of the general “set” 
of the organism? What we are ultimately interested in, it 
may be argued, is complete overt adjustment in a concrete 
situation, and this behavior is frequently and notoriously at 
variance with people’s vérbal professions and opinions. 
While it is true that one of our interests in attitudes is the 
clue they may afford to the complete adjustment of the 
individual or the group, this is not their only E 
Whether a group of people actually does or does not practice 
permanent monogamy, the fact that it verbally declares its 
belief in this practice is still an aspect of its social behavior 
which is of vast significance. Its verbal expressions on the 
subject become perhaps the most important pressure of the 
social environment enforcing the monogamic sex mores in 
,the community. The measurement of opinions, therefore, 
is of itself a matter of importance. 

The fact that opinions or verbal behavior are frequently 
very unreliable guides to what a person would do in a con- 


-3 
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crete situation, does not destroy the value of the verbal be- 
havior as an expression of attitude. It has been suggested 
that a person’s actions (meaning his more complete, overt, 
public, and non-verbal adjustments) are a safer indication of 
his attitude than what he says. But overt actions are fre- 
quently ore to distort or conceal private, covert atti- 
tudes quite as fully as public verbal behavior sometimes con- 
‘ceals private attitudes. When the politician kisses Negro 
babies as an expression of his friendly attitude toward the 
Negro race his action is probably no more an accurate index 
of his “private” attitude than is his impassioned declaration 
of friendslfip from the platform. Both are just as “true” and 
real attitudes in their respective situations as the views he 
expresses or the action he takes toward Negroes after elec- 
tion, The situation in each case is different. The point is 
that our knowledge of attitudes can come only through a 
study of behavior, and all behavior is subject to modification 
in the process of execution from considerations of courtesy, 
expediency, or other social pressures. The attitude which a 
person exhibits, whether through verbal or other behavior, 
in a given situation, represents his attitude in that situation. 
How his responses in a hypothetical or symbolic situation 
correlate with his behavior in a similar concrete situation 
involving complete overt adjustment can be determined only, 
by a study of both kinds of behavior in their respective sit- 
uations.* We can make the index reliable only through a 
careful study and analysis of the conditions under which the 


14 LaPiere, (op. cit.), gives striking illustrations of the lack of correlation 
bet verbal and other behavior in some concrete situations. The student 
is | to read this article as a caution against unwarranted assumptions 

pinion questionnaires and other verbal tests, See also his “Attitudes 
us. Actions,” Soc. Forces, 13:230-37, 1934. Much confusion has arisen from 
the use of sı terms as “true” and “real” attitudes to designate the relatively 
manent, integrated, private, and covert attitudes, ie., what a man “thinks 
n his heart," as contrasted with “false” or “misleading attitudes exemplified 
by rationalizations, conventional opinions, and public expressions of attitude 
conforming to the mores, All attitudinal behavior is, as such, equally “true” 
and “real. „That is, the attitudes expressed in pencil and paper tests are as 
true” and “real” as what one thinks privately or what one does overtly in a 
specific situation. (For examples of the use of such terms as “real”: and 
true” attitudes, see the cases cited in G. W. Allport, op. cit, p. 809.) . The 
ee ow thee yon behaviors correlate with each other is an im? 
u or study, but has i A, ” or “reality” 
ety Sy ot for s go Man no bearing on the “truth” or “reality” of 
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behavior takes place—the whole situation—and from this 
_infer whatever neuro-psychic or neuro-muscular set of a rela- 
' tively permanent kind which we choose to hypothesize and 
call the attitude. All scientific truth is inferred in this way 
from the observation of objective behavior. 

The technique of determining degree of reliability of at- 
titudinal behavior as an index to complete overt adjustment 
behavior in a concrete situation would consist of correlat- 
ing the attitudinal behavior toward a situation with the com- 
plete overt adjustment behavior. Broadly speaking, we may 
say that verbal behavior is likely to be a reliable index of 
other behavior when the behavior in question is not’strongly 
governed by the current mores. Replies to the question, 
“Do you like baseball?” are likely to be “true” because in 
the current American mores there is no taboo or any special 
virtue attaching to either a positive or a negative attitude or 
behavior on the subject. On the other hand, a question 
such as, “Do you favor extra-marital sex relations?” might 
yield results very much less “reliable” in the sense here un- 
der discussion. Chapin has suggested that a correlation 
between (a) the speech reactions, (b) “total overt” behavior 
and (c) past (historical) behavior of a person or group would 
reveal the degree to which one or two of these factors are 
an’index of the others. If such correlations were extensively 
carried out they might become the basis of legitimate gen- 
eralizations as to the degree of correlation to be expected 
between two or more aspects of behavior relative to given 
situations. But in the meantime we should (1) make clear 
exactly what aspects of attitudinal behavior we expect 
study, and (2) limit our generalizations to the type of 
havior studied. 

The term attitude, then, will be employed in the*present 
discussion merely as a blanket symbol for a hypothetical “set 
of the organism toward the object or situation to which ad- 
justment is called for” 1° or a reaction tendency “preparatory 
to overt adjustment.” 17 But as Bernard points out, “it can 
.19F, S. Chapin, Cultural Change, (Century, 1928), pp. 433 Œ. 


16 L. L. Bernard, op. cit., p. 246. 
17 Ibid., p. 248. 
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be known and communicated to another only through its 
overt symbolic responses.” 18 Hence, the only scientific ap- 
proach to the study of attitudes is the selection of some or 
all of these responses —speech, bodily postures, or others — 
and a description of them under given conditions. 

With these considerations in mind, we may proceed to a 
consideration of the types of measurement that have been 
devised to measure attitudes. 


E. METHODS OF STUDYING ATTITUDES 


As indicated above, there are two general types of ap- 
proach to a person’s social attitudes : (1) We may study as far 
as possible all of a person's past and present overt non-verbal 
and verbal behavior in given situations, and from this behav- 
ior infer his attitude ; or (2) we may infer his attitude from 
a study of his verbal behavior in the form of specific opinions. 

We are constantly imputing certain attitudes to people on 
the basis of their general behavior. For example, we make 
inferences regarding a person's religious and moral attitudes 
from his habits of church-going, his reactions to profanity, 
the books in his library, the pictures in his home, etc. We 
may be doing so systematically and for scientific purposes, or 
merely for purposes of our own or his immediate needs of ad- 
justment. In either case, the more complete our information 
of a person’s past behavior and the conditions under which 
it took place, the more accurately we can gauge his attitudes. 


The principal types of sources of such data may be classified 
as follows: 


+ 1. Life History Documents (Chapter XI) 
a) Biographies 
b) Case Histories (Social work records) 


. 

18 7bid, There is, of course, no objection to employing the term “attitudes” 
to indicate subjective states or Paha a rn pr my Sar resent technique, 
only to introspective study. As such, the term may be valuable as an hypotheti- 
cal concept and may have cognitive value. See Ellsworth Faris, “Attitudes and 
Behavior, Amer. Jour. Sociol, $4:271-81, 1926. Also Pitirim Sorokin, Con- 
aa raia ragian er Theories, (Harper, 1928), pp. 655- Also C. H. 

h Roots of Social Knowledge,” Amer. Jour. Pan A nafi -Bo, 1926. 

Much of the controversy on this subject is due to a misunderstanding on the 
ao the terms “subjective” and “introspective.” For an important discussion 
— objectivity of the symbolic and so-called subjective states 

a a . Markey, ae ecg Process, (Harcourt, 1928), pp. 144-48. See also, 
. A. Lundberg, Foundations of Sociology, (Macmillan, 1939). pp. 10-34- 


MEASUREMENT OF ATTITUDES 221 


(c) Letters 
(d) Diaries or memoirs 
2. Oral interviews (Chapter XI) 
§- Questionnaires and polls (Chapter VII). 


It should be observed that the above classes are not mu- 
tually exclusive. Life history documents, for example, might 
be secured through oral interviews or through Meson: 
naires, as well as through collection of the documents as pre- 
pared for other purposes remote from any thought of their 
use as scientific data. In this respect, however, the case his- 
tory as prepared and used by social agencies perhaps resem- 
bles a kind of inquiry comparable to that of the interview 
and the questionnaire rather than to that of letters, diaries, 
etc. The fact is that all sociological inquiry necessarily deals 
with segments of social living, and the above categories are 
merely crude designations of very general and not mutually 
exclusive ways of securing and analyzing samples of such 
life-segments. We shall devote ourselves in this chapter 
chiefly to methods of studying attitudes through the methods 
of polls, tests, and scales, and leave the other techniques to 
other chapters (Chapters VII, IX, X, XI). 


F. ATTITUDE QUESTIONNAIRES AND POLLS 


The preceding chapter dealt with questionnaires designed 
to determine matters of fact as well as matters of opinion. 
The principles there set down will also apply to many of the 
ways of determining attitudes and opinions to be discussed 
below. The extensive use of the “Yes,” “No,” “No opinion” 
type of question with variations to suit the type of inqui 
in the national public opinion polls has been accompanied 7 
by careful study and pretesting of this type of attitude test. 
The experience has yielded many data regarding the value 
of questions of this type and the importance of care in their 
construction.® Although these questions are administered 
for the most part through interviews, the principles govern- 
ing their formulation apply equally to either written or oral 
tests. 

19 See G. Gallup and S. F. Rae, The Pulse of Democracy, (Simon and Schus- 
ter, 1940), Chap. 7. 
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Examples of poll questions : 


1. “Should college teachers be free to express their views on all 
subjects, including politics and religion? (May, 1936).*° 


Yes 59% 
No 41% 


100% 

2. “Do you approve of a married woman earning money in 
industry or business if she has a husband capable of supporting 
her?” (November, 1936).*4 

No 82% 
Yes 18% 


-& 


pi 


100% 
3. “To what social class in this country do you feel you be- 
long? Middle class, upper class or lower class?” (April, 


1939). 
Upper 6 
Middle 88%, 
Lower 6% 


100 
4. “Which way do you think reli should be given —in the 
form of work relief (such as a W.P.A. job), or as direct cash re- 
lief?” (May, 1939). 


Work relief 8 
Cash relief ue 
Q 100% 

No attempt is made in this type of attitude study to ex- 
press individual scores by a number. The results are inter- 
preted entirely in terms of the number or percent of people 
responding in a particular way to a single statement, phrase, 

any other attitude indicator. This method of study is 
advantageous and popular because of the ease and positive- 
ness with which the results can be manipulated statistically, 
and the comparatively brief task it usually imposes upon the 
informant. Its chief disadvantage lies in the difficulty of 
formulating questions on which a true attitude can be ex- 
pressed in an absolute and unqualified affirmative or nega- 
tive. Actual attitudes tend to differ by infinitely small shad- 


® Ibid., p. 310. 
2 Ibid., p. 309. 
23 Ibid., p. 309. 
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ings and gradations. Thus in such questions as the first three 
above, some people would hesitate to commit themselves en- 
tirely without reservation. One way of avoiding the need 
for reservations is to phrase the question in sufficiently ex- 
treme form so that most people could without reservation 
indicate their attitude by “Yes” or “No.” To the extent that 
this remedy is resorted to, however, the purpose of the inves- 
tigation is largely defeated. For we are usually interested 
in attitudes not on questions on which we already know the 
unanimity to be great, but on the finer gradations of atti- 
tudes. That is, we are interested chiefly in attitudes on 
“borderline” questions — questions on which there is much 
reasonable difference of opinion.” On such subjects it is 
frequently very difficult to formulate a question that can be 
answered with an unqualified “Yes” or “No.” 

To meet the above objection, a third category, “doubtful,” 
is frequently provided as a third possible answer. In order 
to afford more adequate opportunity to express gradations of 
attitude in this type of study various forms of arbitrary scales 
have been invented. 


G. ARBITRARY SCALES 


The devices for measuring attitudes to be considered in 
this section are called arbitrary scales because the distances 
between the steps or units depend on arbitrary assumption 
rather than on experimental determination. Some arbitrary 
rules of scoring are also usually adopted in these scales so 
that responses of different individuals are reduced to a cer- 
tain abstract numerical score. These scores in relation to 
each other represent the relative degrees to which each per- 


son is supposed to hold a given attitude. It is only when 


they are so scored that they constitute a scale. The answers 
to many of the questions given below, for example, could be 
handled statistically as in the case of the polls considered 
above. That is, the percentage of a population answering a 
certain way could be computed. If so, these same tests 
would belong in the previous classification rather than in the 
class of scales, 


3 See Stuart A. Rice, “The Political Vote as a Frequency Distribution,” 
Four. Amer, Statis. Asso, 19: (NS.), 70-75, 1924- 
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Four types of arbitrary scales, each with a number of sub- 
types, can be distinguished and illustrated as follows : 

1. Point scales. These are characterized by the fact that, 
in scoring, one point is usually given for each agreement or 
disagreement with a statement, or words, or situations as 
described or pictured. The following is an example of a 

+ common type: 
The cross-out test. Example : 


Directions: Read through the words listed below. Consider 
each one quickly. If it suggests more that is disagreeable than is 
agreeable, cross it out. You may cross out many or few words. 
Work as rapidly as you can, but be sure you cross out every word 
which is more annoying than pleasing, more agonizing than ap- 
pealing, more distasteful than attractive. 


Bolshevist Dancing 
Mystic Prayer 
Sunday Blue Laws Birth Control 
Ku Klux Klan I. W. W. 


(The complete test has 51 words.) 


This type of test illustrates what is perhaps the simplest 
method of ascertaining attitudes through verbal tests.2° The 
method is based on the principle that a tendency to cross out 
an unusually large number of words as annoying or distaste- 
ful is an indication of a general emotional set, or at least in- 
dicates an attitude on the symbolized subjects. To the ex- 
tent that the simplicity of this test secures a spontaneous re- 
action, and to the extent that the conditions under which it 
1s given are such as to reduce to a minimum the tendency of 
the person tested to be influenced by considerations of what 
others will think if he respohds entirely spontaneously, this 

* method of discovering attitudes undoubtedly has some de- 
gree of validity. The latter condition can perhaps be best 
approached by giving the test in a large group, and by not 


requiring the person taking the test to attach his name or 
other identification mark to the paper, 


% The above illustration, from G. B. Watson, The Measurement of Fair- 
Mindedness, (Teachers College, Columbia University, 1925), also contains 
data on the reliability and the validity of the test. See Chap. 4 and pp. 89-91- 
= other and more recent examples of its use, see R. Stagner, “Measuring 

elationships among be By Public Opinion Quarterly, 2:622-27, 
1938. Also R. Stagner, “ Out Technique as a Method in Public 
Opinion Analysis,” Jour, Soc. Psychol., 11:79-90, 1940. 
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Two considerations affect the validity of the results achieved 
by this type of test. First, it is not always easy to select a list 
of words sufficiently suggestive of at least two specific atti- 
tudes. Many thoughtful individuals will find it difficult to 
do themselves justice when confronted with the list because 
they may wish for certain reasons to cross a word out, and for 
certain other reasons to leave it unchecked. When a large 
number of words of the same general implication or mean- 
ing are crossed out, however, it is probably a safe, though 
rough, indication of unfavorable attitude toward the things 
suggested by the words crossed out. Second, in many situa- 
tions, attitudes cannot be expressed in a singlé word or phrase. 

Stagner has used the test recently in studying configura- 
tions or associations between attitudes. From a test of forty 
items including such words as labor union, socialist, capi- 
talist, spiritualist, Holy Communion, and other phrases 
identifying social, religious, and political institutions or 
ideologies, tetrachoric correlations were computed showing 
to what degree the dislike of a ‘particular idea like capitalist 
is associated with dislike of such words as Republican, Wall 
Street, etc. Because of the limitations noted above, however, 
the cross-out test is not among the more frequently used tests. 

Examples of another “point scale” type of attitude test are 
the more frequently used lists of statements in which the 
score is simply the sum of the endorsed statements,** and 
others which are scored by summing both the endorsed and 
rejected items in the test.” In the latter case, the statements’ 
are first classified into broad categories, such as “radical” and 
“conservative.” The score is the sum of all the conservative 
statements endorsed plus all the radical statements with which 
the respondent disagrees. All of the “point” scales are useful 
for some purposes, although they make no attempt to regis- 
ter the gradations in intensity of attitude which some of the 
more refined scales attempt. 

2. Ranking scales. These consist of a series of words (¢.g., 
nationalities, occupations, etc.), statements, or pictures of peo- 


26 C. Kirkpatrick, “A Methodological Analysis of Feminism in Relation to 
Marital Adjustment,” Amer. Sociol. Rev., 4:325-34, 1939- Nea 

27 T. F. Lenz, “Generality and Specificity of Conservatism-Radicalism,” 
Jour. Educ. Psychol., 29:540-46, 1938. Also È. Nelson, “Radicalism-Conserv- 
atism in Student Attitudes,” Psychol. Monograph., 50, No. 4, 1938. 
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ple or situations. The persons whose attitudes are being de- 
termined sare asked to rank the material in order of prefer- 
ence as indicative of their attitude.28 The results may be 
scored in various ways. Usually ranking tests are scored ac- 
cording to the degree to which the responses agree with a pre- 
determined assignment of attitude value to each item. The 
responses may also be scored according to the degree to which 
they deviate in a significant manner from chance expectancy. 

Example : Horowitz tested racial attitudes by securing 
twelve photographs, eight of Negro children and four of 
white children who had been judged for racial typicality and 
general pleasantness. The photographs were presented to 
school children—kindergarten through the eighth grade— 
with the instruction: “Pick out the one you like best, next 
best, next best . . .” and so on until the pictures were all 
ranked. Scoring was by summing the ranks assigned the 4 
white faces. If these faces were ranked 1, 2, 3, 4, the score 
was 1+ 2-++3-+-4=10; if they were ranked 1, 2, 5, 9, the score 
was 17. The lower the score, the greater the preference for 
the white faces. The possible range was from 10 to 42. If 
the ranking had been done by chance, the score might be ex- 
pected to be 26. Deviation from the chance expectancy may 
be taken as indicative of the operation of a bias.” 3 

As a variation of the above, the children were asked to 
select from the same twelve photographs companions for a 
variety of imagined situations. For twelve different situa- 
‘tions the children were asked such questions as the following: 


“1. Show me all those that you want to sit next to you on 
a street car 


4. Show me all those that you want to come to your 
party 

6. Show me all those that 
for lunch 


28 With reference to the optimum number of ranks to ask for, Hazel Gaudet 
("The Favorite Radio Program,” Jour. Applied P. chol, 23:115-26, 1999). €X- 
Presses the opinion that three choices the results. The number 
| ag Varies according to the subject-matter. See also, G. B. Hotchkiss 

Franken, The Measurement of Advertising Effects, (Harper, 1927), 


Horowitz, The 


aa ee of Attitude Toward the Negro (At 
be oe nR Columbia University, New York, 19365). e 
+ P- to 


you want to go home with you 
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7. Show me all those that you want to sit mext to in the 


movies » 
11. Show me all those that you want to live next doorgto 
you 


“Scoring the Show Me test was by finding the relative fre- 
quency with which the white faces were selected for all ac- 
tivities, expressed as a percent. The number of companions 
selected for all twelve items of the test was determined, then 
the number of those selections which were of the white boys 
(maximum, of course, 48) ; the frequency of the white choices 
was then computed as a percent of the total*number of se- 
lections. Since of twelve faces, four were white, if the selec- 
tions of companions were on a chance basis, the score might 
be expected to be 334%. Deviation from the chance ex- 
pectancy may be taken as indicative of the operation of a 
bias. 

“On these two tests, the interpretation of the scores is 
this: on the Ranks test scores below 26 show preference for 
white as compared with Negroes (the smaller the score, the 
greater the preference) ; on the Show Me test scores above 
334% show prejudice for white as compared with Negroes 
(the higher the score, the greater the prejudice against Ne- 
groes).” * 

Words, phrases, and statements rather than pictures can 
be ranked, of course, and the results scored in the same man- 
ner. Indeed, the use of pictures is the exception rather than 
the most common practice.**? The use of photographs or 
motion pictures of people and situations may be of great 
value in groups to which spoken or written language is of 
doubtful meaning. On the other hand, such tests are rela- 
tively difficult and expensive to administer and, while the 
stimulus is non-verbal, the response is necessarily verbal, as 
in written tests. In the case of children or groups of mixed 
and varied languages, these tests have obvious value either 
alone or as supplements to other methods. 

3i Ibid., pp. 9, 10. 

82 For examples of ranking scales of the verbal type, see Helen Pallister, 
“Vocational Preferences of School-Learners in a Scottish Industrial Area,” 


Brit. Four. Psychol., 29 144-66, 1938. Also A. L. Porterfield, “Education and 


Race Attitudes,” Socio . and Soe. Res., 21:598-43, 1987- 


228 SOCIAL RESEARCH 


3. Rating (intensity) scales. In this type of instrument, 
steps on an attitude continuum are represented by responses 
(Such as “strongly agree,” “agree,” “disagree,” “strongly dis- 
agree”) to separate statements or other attitude indicators 
not necessarily related to each other or to the same subject. 
The statements constituting these tests are frequently identi- 
cal with those used in public opinion polls calling for re- 
sponses of “Yes,” “No,” “Doubtful,” or “No opinion.” In- 
deed, if the answers to the statements on an intensity scale are 
scored simply in terms of the percentage of a population ve- 
sponding to each degree of intensity, the rating type of arbi- 
trary scale should be classified rather as a type of opinion 
questionnaire or poll. When, however, arbitrary weights 
are assigned in scoring to each degree of intensity, such as 
1, 2, 3, 4, 5 in the example below, each statement becomes 
an arbitrary scale. 


Examples of intensity scales : 


(a) Directions: Please indicate your opinion about each of the 
statements below by drawing a circle around the letter or letters 
in the margin which express your judgment. This is what the 
letters mean : 

T-True eee) 
Papen or partly true 
-In Doubt, Divided, O ti 

Thin eek partly False oe 

*  FFalse (absolutely) 


T PT D PFF 11. The United States is justified in keeping 
the Philippines in order to insure our sup- 
ply of raw rubber. 


T PT D PF F 18. Japan sells more goods to the United States 
than to any other foreign country. 


TPTDPFF 26. It is Not justifiable for a loyal American 
izen to protest against any course of 
action followed deliberately by the United 

States Minister to China.” 


(b) In the interest of permanent peace, we should be willing 
33 For a p detailed discussion, -P , ods, 
La ill, 1996), Chap. g. see J. P. Guilford, Psychometric Meth 
illustration is taken from the < Pacific 
Relations, 129 E. send St, New York AA of the Institute of Paci 
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to arbitrate absolutely all differences with other “ations which we 
cannot readily settle by diplomacy.35 P 


Strongly Approve Undecided Disapprove Strongly” 
approve disapprove 


(c) In order to aid you in ex ressing your opinion more 
clearly, you will find below a scale which will enable you to 
check your degree of liking from +10, meaning “exceptionally 
good,” to —10, meaning Ri, r,” with zero representing an 
average amount of like or dis ike. 

“Referring to each program named above, would you please 
place a check on the scale at the point you feel represents your 
degree of liking for the program? Please keep the same order : 
Scale ‘a’ corresponds to program ‘a’,’” 36 

Exceptionally 
Very poor Average 


/ / / 
Zio = o +5 Fo 


In example (a) the first two statements are in less desirable 
form than the third. The first is confusing to many people 
in that it seems to contain two propositions: (1) that the 
United States is justified in keeping the Philippines; and 
(2) that the maintenance of our rubber supply is a valid 
reason for keeping the Philippines. When confronted with 
this type of statement people will frequently say: “I believe 
we are justified in keeping the Philippines, but not for this 
reason. How shall I answer?” If the point on which opin- 
ion is desired is the validity of the reason stated for our re: 
tention of the Philippines, as is clear from the statement as 
a whole, the possible ambiguity in interpretation mentioned 
above could be avoided by rephrasing the proposition so as 
to emphasize this; thus: “The maintenance of the raw rub- 
ber supply of the United States justifies us in keeping the 
Philippines.” 

The second statement under (a) is objectionable because it 
is a matter of statistical information rather than of opinion or 
attitude. To be sure, the question may be of value in show- 

35 From G. Murphy and R. Likert, Public Opinion and the Individual, 
(Harper, 1938), p. 19. The special type of scoring developed by these authors 
will be disc in a later connection. 

38 Hazel Gaudet, “The Favorite Radio Program,” Jour. Applied Psychol., 23: 


120, 1939. See also, H. E. Pemberton, “Optimum Rating Scale Public 
Opinion,” Sociol. and Soc. Res., 17:470-72, 1932. 
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ing how exaggerated is a person’s idea of the importance of 
our commercial relations with Japan. But since most peo- 
ple know that this is a question on which rather accurate 
objective information is available, they will tend to decline 
to express themselves unless they happen to know definitely. 
In short, answers to this type of question tend to reveal in- 
formation rather than the less objective attitude which we 
are trying to measure in attitude questionnaires. As a ques- 
tion on information, it is not, of course, objectionable. 

The last of the three questions under (a) above comes 
closer to revealing an attitude in the usual sense. It involves 
a question of personal attitude toward ethical or patriotic 
ideals, and as such falls more closely into the realm which 
attitude investigations are designed to survey. 

This type of test aims to discover people's attitudes by 
their tendency to hold extreme or positive views on doubt- 
ful issues. Its value will, therefore, depend largely on the 
care with which the statements are selected. They should 
be statements which are decidedly doubtful, impossible to 
verify objectively, and on which reasonable people may dis- 
agree. If this is true of the questions used, then an extreme 
view in either direction may properly be regarded as indica- 
tive of definite attitude or bias. Only the expression of the 
extreme view in either direction, therefore, is usually scored 
as indicative of definite attitudinal bias. 

This type of test has the advantage over the more arbi- 
trary “True” or “False,” “Yes” or “No,” type in that it al- 
lows for a gradation of opinion or attitude. The results 
therefore permit a tabulation of the attitudes with respect to 
degree of positiveness as well as to their negative or positive 
character relative to a situation. This is a matter of con- 
siderable importance in drawing conclusions from the re- 
sults, and is further emphasized in some of the examples that 
follow. 

The number of possible responses in intensity scales may, 
of course, vary from two to six or more. On this point one 
study ** revealed (a) that more people were willing to respond 
when a four-step response was permitted ; (b) that the re- 


HE. E Ghiselli, “All or None versus Graded Response Questionnaires.” 
Jour. Applied Psych., 23:405 f., 1939 er? 
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quirement of absolute choice between two mutually exclusive 
statements made the expressed opinions distinctly less favor- 
able on the issue tested (belief in the sincerity of advertis- 
ing) ; and (c) that the “‘yes-no” type of responses is unreliable 
as a measure of average opinion. Day reports that in current 
literature on the subject, seventy-seven percent of the scales 
are of the five-response type.*° Again, the stimulus to the 
response may be a general theoretical statement, a pictorial 
behavior situation or a verbal behavior situation.» The 
behavior situation test may be illustrated as follows: 


1. You are reading in a public library. A Negro comes in and 
sits down beside you. 

a. You rise and go to another table. 

b. You keep right on with your reading unconcerned. 

c. You complain to the librarian. 

d. You leave the library at once.‘ 

2. Your state needs an additional tax so that its budget can be 
balanced. If you could vote, and a tax bill which provided for a 
widely distributed general retail sales tax was submitted to the 
people of the state for approval, what would you do? 


a. Not vote at all. 

b. Vote for the bill. 

c. Vote against the bill. 

d. Vote for the bill and try to persuade others to vote for it too. 

e. Vote against the bill and try to persuade others to vote 
against it too. 


There is some evidence that attitudes expressed toward con- 
crete behavior situations have a higher reliability than the- 


38 Daniel Day, “Methods in Attitude Research,” Amer. Sociol. Rev., 5:895- 
10, 1940, Some recent examples of the five-response type of scale may be 
‘ound in: M. M. Smith, “Comparative Social Attitudes,” Four. Educ. Psychol, 

28:681-93, 1938; F. C. Sumner and A. $. Campbell, “Attitudes Toward the 
Administration of Justice,” Jour. Psychol., 8:23-52, 1939; E. G. Williamson 
and J. G. Darley, “The Measurement of Social Attitudes of College Students : 
I. Standardization of Tests and the Results of a Survey,” Jour. Soc. Psychol, 
8:219-29, 1937 ; C. N. Winslow, “A Study of the Extent of Agreement Between 
Friends’ Opinions and their Ability to Estimate the Opinions of Each Other,” 
Ibid., 8:433-42, 1987- - 4 

% For examples of behavior situation scales see Leigh Peck and Elma Beck- 
man, “An Experimental Study of Children’s Attitudes Toward Relief,” Jour. 
Soc. Psych, 8:205-18, 1937 ; A. C. Rosander, “An Attitude Scale Based u ) 
Behavior Situations,” Four. Soc. Psych, 8:3-15, 1937 ; and R. N. Ford, “Scaling 
Experience by a Multiple-Res Technique: A Study of White-Negro 
Contacts,” Amer. Sociol, Rev., 0:9-28, 1941. 

# A. C. Rosander, op. cit. É 

41C, R. Pace, “A Situation Test to Measure Social-Political-Economic Atti- 


tudes,” Jour. Soc. Psychol., 10:331-44, 1939- 
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oretical and opinion types of questions.** It is also held that 
a question involving a behavior situation is likely to be more 
valid as an index of actual behavior in such situations be- 
case it merely asks the direct and simple question, “What 
would you do?” This is perhaps true to the highest extent 
when the situation described is one which the responder has 
actually encountered, or which is not too fantastic or too 
foreign to his experience. 

4. Statement or unequal interval scales. These scales con- 
sist of statements arranged by one or a few persons in’ what 


appears to them to be a logical order from favorable to un- _ 


favorable or some similar characterization. The similarity 


of these tests to the public opinion poll and the: ranking y 


method on the one hand, and the experimental or equal 
interval scales on the other hand, willbe noted. ‘The state- 
ment scale, however, is usually scored by the scale number 
(which has been assigned in advance) of the cheéked state- 
ment or the central tendency of the scale numbers of two or 
more endorsed statements. In this way the statement scale 
differs from the statements of the polls and the ranking scale, 
which have not been assigned ranks or scale values in ad- 
vance. The difference between the statement scale and the 
experimental scales, to be discussed below, lies entirely, as 
we shall see, in the manner of selecting and arranging the 


statements. Several types of statement or unequal-interval _ 


i“ 


scales are in comimon use. 
Examples : zg “Rae yet 


one of five ways. Check whichever one of the ; 
approximates your own opinion. 
4. Income taxes should be levied: 
a. Only upon those with immense fortunes. 
b. Only upon those having ample means. 
£ “sed = the upper and middle classes. 
pon all exce à 
e. Upon all.4s Co 
“D. D. Da ~ 5 i 
and Opinion,” Soc and oe eas asipay tgp, mee WAT Anida 
41G. Murphy and R. Likert, op. City p- 1}. 


a 


_ (ap Directions: In each of the following you àe given 4 pre- 

liminary statement which can be completed or ihany 
ye most: closely ,- 
fo 


a 
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(b) The Social Distance Test.44 

According to my first feeling reactions I would willingly admit 
members of each racial group (as a class and not the best I have 
known, nor the worst members) to one or more of the oups 
under which I have placed a cross (X). (If you are who y un- 
familiar with any one of the racial groups, then no marks need 
be made. Note that crosses,may be put in any number of the 
first fiye columns for each racial group. Note that a cross put 
in either row 6 or 7 for a given racial group means that no 
other crosses should be given that group.) ; 


- 1.°To close kinship by 
J marriage. 


2, To club as per- 
sonal chums. | 


3. To my street as 
neighbors. 

4. To employment in 

> my scout pation. 

5. To citizenship in my 
country. 


6. As visitérs only to 
my.country, 


7. Would exclude from 
my country. n 


. This*device is merely a variation of the rating system in- 


volved in the illustration immediately preceding. It is de- 


signed to méasure the “social distance” between the subject 
and various nati ies or races. The same device can} of 
course, béused in relation to occupations, interests, and other 
social conditions. Its chief difficulty lies in the construction 


f: of the scale representing degrees of “distance” or other fac- 


tors measured. As it is, it represents a valuable me 
measuring relative attitudes.‘ 


44 Note that if the method of equal appesting intervals, to be described 
below, has been applied to the statements in the social distance test, the test 
should be classified as an experimental or equal-interval scale. For an exam- 
ple, see below, p. 242. y 

4* The example given above is from E. S. Bogardus, The New Social Re- 
search, (Miller, Los Angeles, 1926), p. 212. For a more recent treatment of 
this subject by the same author, see “Scales in Social Research,” Sociol. and 


Soc. Res., 24:69°75, 1939- ~ 
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H. EXPERIMENTAL SCALES 


These scales differ from the arbitrary scales discussed above 
in that the statements or other attitude indicators are sorted 
or separated by a large number of judges in such a way as to 
furnish a rational, experimentally determined basis for the 
attitude continuum which constitutes the scale. Three forms 
are in current use. 

1, The method of paired comparisons, In this method 
every item toward which an attitude is to be indicated is 
compared with every other item, one of the two being ranked 
above the other in each pair. The following example will 
illustrate the form of this test: 

H 


Example : 


This is an experimental study of group attitudes. You are 
asked merely to underline the one of each pair with whom you 
would rather associate. For example, the first pair is 

Englishman — South American * 


* The complete test contains similar pairs of every nationality paired with 
every other, care being taken that no nationality is repeated twice in succes- 
sion or in any other rhythmical pattern, and that every race appears an equal 
number of times on the right and the left of the pair. 


If, in general, you prefer to associate with Englishmen’ rather 
than with South Acdiricion, underline Englishman, If you pre- 
fer, in general, to associate with South Americans, underline 
South American. If you find it difficult to decide for any pair, 
simply underline one of them anyway. Be sure to underline one 
of each pair even if you have to guess.45 : 
Calculation of scale values is based on the proportion of 
persons choosing one particular object or statement of atti- 
tudefas against its alternative. These judgments with refer- 
ence to a given object or statement tend to distribute them- 
selves according to the normal probability curve, giving for 
each item its discriminal dispersion. The standard devia- 
tion of this dispersion is the unit of the scale constructed 
on the basis of the judgments of all the pairs. The full de- 
tails of the method of constructing such a scale cannot be 
given here, as the method is laborious and requires consid- 
erable space for adequate elaboration. A lucid exposition of 
the subject, together with several variations and labor-saving 


€H. Meltzer, “Group Differences ationality Preferences of 
Children,” Sociometry, 2:88, 1939- > n n 
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devices, will be found in J. P. Guilford’s excellent text. 
The method of paired comparisons shows high reliability 
and validity as conventionally estimated and, with the excep- 
tion of the method of equal appearing intervals, must be 
regarded at present as the most rational and reliable of all 
techniques of constructing attitude’ scales. As a matter of 
fact, the results of these methods are highly similar, except 
that in the method of paired comparisons items having a scale 
difference of an irrelevant nature may remain undetected.** 
It is true that in many cases equally reliable results can be 
secured through less laborious methods, and that for some 
purposes less accurate methods may be sufficiently good. 
The fact remains that we frequently do not know how gen- 
erally this similarity of result secured by the simpler meth- 
ods can be depended upon to occur. Scales constructed by 
the method of paired comparisons may serve, therefore, as 
valuable criteria of validity against which to check less ac- 
curate and less dependable methods. 

2. The order of merit method.‘s This type of scale is 
identical with the ranking method as described above under 
arbitrary. scales, except that the results are treated as in the 
case of the method of paired comparisons. A computational 
short-cut in determining scale values has been prepared by 
Barnhart.*® | 

3. The method of equal-appearing intervals (equal-i 
method)? This is on logical, rational and theoretical grounds 
the most defensible method of measuring attitudes yet de- 
veloped.** The method was first developed by Delboeuf 
for the measurement of sensory discrimination,” but its ap- 

46 Psychometric Methods, (McGraw-Hill, 1936), Chap. 7. See also H. Gurnee 
and E. Baker, “The Social Distance of Some Common Social Relationships,” 
Four. Abnormal and Soc. Psychol., 33:265-69, 1938. This study applies the 
method of paired comparisons to the social distance test. 


4L. W. Ferguson, “The Requirements of an Adequate Attitude Scale,” 


Psychol. Bull., 36:665-73, 1939- 3 4 
48 An excellent discussion i the details of procedùre in this method will 


be found in J. P. Guilford, op. cit., _ 8. yri 
4 E. N. Barnhart, “A Computational Short Cut in Determining Scale Values 


for Ranking Items,” Psychometrika, 4:241-42, 1939- s 
50 For elaboration of this method and detailed methods of computation, 


see J. P. Guilford, Psychometric Methods, Chap. 5. i 
51 For a review of the evidence for this statement, see L, W. Ferguson, op. cit. 


52 Sce E. G. Boring, History of Experimental Psychology (Appleton-Century, 
1929). 
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plication to attitude measurement is primarily the work of 
L. L. Thurstone, whose work in the field is of such dom- 
inance and distinction that scales of this type are quite gen- 
erally, and justly, called “Thurstone scales.” Thurstone's 
own description of the method of their construction can 
probably not be improved upon : 


Several groups of people are asked to write out their opinions 
on the issue in question, and the literature is searched for suitable 
brief statements that may serve the purposes of the scale. By edit- 
ing such material a list of from 100 to 150 statements is prepared 
expressive of attitudes covering as far as possible all gradations 
from one end of the scale to the other. It is sometimes necessary 
to Ee special attention to the neutral statements. If a random 
collection of statements of opinion should fail to produce neutral 
Statements, there is some danger that the scale will break in two 
parts. The whole range of attitudes must be fairly well covered, 
as far as one can tell by preliminary inspection, in order to insure 
that there will be overlapping in the rank orders of different read- 
ers throughout the scale. 

In making the initial list of statements several practical criteria 
are applied in the first editing work. Some of the important cri- 
teria are as follows: (1) the statements should be as brief as pos- 
sible so as not to fatigue the subjects who are asked to read the 
whole list. (2) The statements should be such that they can be 
indorsed or rejecte in accordance with their agreement or dis- 
agreement wit the attitude of the reader. Some statements in a 
random sample will be so phrased that the reader can express no 
definite indorsement or rejection of them. (3) Every statement 
should be such that acceptance or rejection of the statement does 
indicate something regarding the reader's attitude about the issue 
in question. If, for example, the statement is made that war is 
an incentive to inventive genius, the ce or rejection of it 
really does not say anything regarding reader's pacifistic or 
militaristic tendencies. He may the statement as an un- 
questioned fact and simply indorse it as a fact, in which case his 
answer has not revealed anything concerning his own attitude on 
the issue in question. However, only the conspicuous examples 
of this effect should be eliminated by inspection, because an ob- 
jective criterion is available for detecting such statements so that 
their elimination from the scale will automatic, Personal 
judgment should be minimized as far as possible in this type of 
work. (4) Double-barreled statement should be avoided except 
possibly as examples of neutrality when better neutral statements 
do not seem to be readily available. Double-barreled statements 
tend to have a high ambiguity. (5) One must insure that at least 
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a fair majority of the statements really belong on the attitude 
variable that is to be measured. If a small number of irrelevant 
statements should be either intentionally or unintentionally left 
in the series, they will be automatically eliminated by an objective 
criterion, but the criterion will not be successful unless the ma- 
jority of the statements are clearly a part of the stipulated variable. 

When the original list has been edited with these factors in 
mind, there will be perhaps 80 to 100 statements to be actually 
scaled, These statements are then mimeographed on small cards, 
one statement to each card. Two or three hundred subjects are 
asked to arrange the statements in eleven piles ranging from 
opinions most strongly affirmative. to those most strongl negative, 
The detailed instructions will be published with the p pea R 
of the separate scales. The task is essentially to sort out the small 
cards into eleven piles so that they seem to be fairly evenly spaced 
or graded. Only the two ends and the middle pile are labeled. 
The middle pile is indicated for neutral opinions. The reader 
must decide for each statement which of five subjective degrees 
of affirmation or five subjective degrees of negation is implied in 
the statement or whether it is a neutral opinion. 

When such sorting has been Sania. by two or three hun- 
dred readers, a diagram like Figure 2 is prepared. We shall dis- 
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Ficure 2. (From Thurstone.) 


cuss it with the scale for pacifism-militarism as an example. On 
the base line of this diagram are represented the eleven ap- 
parently equal steps of the attitude variable. The neutral inter- 
val is the interval 5 to 6, the most pacifistic interval from oton, 
. and the most militaristic interval from 10 to 11. This diagram is 
fictitious and is drawn to show the principle involved. Curve A 
is drawn to show the manner in which one of the statements 
might be classified by the three hundred readers. It is not classi- 
Ged by anyone below the value of 3, half of the readers classify it 
below the value 6, and all of them classify it below the value g. 
The scale value of the statement is that scale value below which 
just one half of the readers place it. In other words, the scale 
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value assigned to the statement is so chosen that one half of the 
readers consider it more militaristic and one half of them con- 
sider it less militaristic than the scale value assigned. The numer- 
ical calculation of the scale value is similar to the calculation 
of the limen by the phi-gamma hypothesis in psychophysical 
measurement.>3 

The above description does not cover, of course, all the 
details in the construction of a Thurstone scale. For a full 
account the original monograph should be consulted.% 

4. Mixed Methods. As has been indicated in the course 
of the above discussion of various methods, many tests are 
largely the same in both form and method of scoring. In 
some cases a change in the method of scoring a test changes 
its classification from an arbitrary to an experimental scale. 
For example, the simple ranking method becomes an experi- 
mental scale when the statistical methods of paired compari- 
sons are applied to the rankings. Likewise the social dis- 
tance test as usually constructed becomes an equal interval 
scale where the Thurstone method is applied to it. A few 


of the more significant combinations of methods should be ` 


mentioned in concluding this review of current methods. 

a. Likert found that a great number of five-point arbi- 
trary scales of the multiple choice, “statement,” or “inten- 
sity” types yielded distributions resembling a normal curve. 
On this basis he devised the “sigma method” of scoring. 
The percentage of individuals that checked each of the pos- 
sible gradations on a particular statement was converted into 


sigma (standard deviation) values. These values were taken 
from Thorndike's tables ® sho 


bles wing the average values (in 
terms of standard deviations on a normal curve) represented 


by the stated percentages, the origin considered to be at the 
mean." For example, if 13 percent check a certain state- 
ment “strongly approve,” the Thorndike tables show that 
this corresponds to a sigma value of —1.63. Likewise, if 43 


53 For elaboration of this subject methods 
Guilford, Psychometric Methods, McGraw BiN ae a J:n 
4 For full details as to the construction of these tests, oe L. L. Thurstone 


% For a full description of this : 
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percent check “approve,” 21 percent “undecided,” 13 percent 
“disapprove,” and 10 percent “strongly disapprove,” the cor 
responding sigma values are, respectively, —.43, +-48, +-99, 
and +1.76. These sigma values now correspond to 1, 2, 3, 
4 and 5 which are used ordinarily in scoring such series of 
statements. Observe that the sigma values, since they repre- 
sent a percentage of people who have responded to a given 
statement in a certain way, constitute a kind of generalized 
scale in terms of which the responses to a large variety of 
questions can be compared. 

This method has two conspicuous advantages, one of them 
chiefly practical, the other of theoretical importance. As 
compared with the Thurstone technique, the Likert method 
is much less laborious in that it eliminates the sorting of 
statements by a selected panel of judges and substitutes the 
voting or attitude expressions of the people actually taking 
the test. If the Thurstone scales are to be assumed to have 
fairly general validity, it must be assumed that the scale 
values, as arrived at, are independent of the attitude distri- 
bution of the judges who sort the statements, an assumption 
which has been questioned. Using the same statements as 
on a Thurstone scale, Likert’s method has been found to 
yield a high correlation (in the cases tried) with the Thur- 
stone scores, as was perhaps to have been expected. Like- 
wise, the somewhat higher reliability shown by. Likert’s 
method (five possible responses) when applied to a Thurstone 
scale (two responses) is undoubtedly attributable to the demon- 
strated fact that as the number of possible responses increases 
(up to a certain limit) reliability also increases.” A scale 
which Likert constructed independently of Thurstone’s 
method yielded correlations of only .65 and .71 with the 
latter." 

More extensive testing and comparisons of results of these 
methods are needed before final conclusions can be drawn. 
Basing the scale on equal-appearing differences to a random 
sample of the group to be measured rather than to an out- 


$7 E. S. Conklin, “The Scale of Values Method for Studies in Genetic Psy- 


chology,” University of Oregon Publications, 2:1-36, 1923. 
58 L, W. Ferguson, op. cit., p. 671. See also, by the same author, “A Study 
of the Likert Technique of Attitude Scale Construction,” Jour. Soc. Psych., 
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side panel of judges undoubtedly remains an important con- 
sideration in estimating the validity of the scale. All re- 
ported evidence, however, shows that scales constructed by 
the method of equal-appearing intervals, involving judges, 
are remarkably independent of the attitude distribution™ 
of the judges. This is an important fact which has not yet 
been demonstrated for other types of scales. 

b. R. N. Ford® combined some aspects of the Thurstone 
and Likert techniques in scaling statements of degrees of 
personal and community contacts with Negroes. Statements 
of such contacts, Ford found, could be scaled with reliability 
and validity according to the usual standards. While Ford's 
scales are ostensibly “experience” scales rather than attitude 
scales, they may be of great importance in checking our the- 
ories of the origins of attitudes and the relations between 
experience and attitude. Also, many scales of experience 
are likely to reflect attitude, as for example: “What has been 
your policy in the past when a Negro came into a public 
place, such as a theatre, railroad waiting room or restaurant? 


1. Let the Negro know he wasn’t wanted. s 
2. Let him alone so long as he knew his place, 
3. Situation never arose. 


4. Tried to pay no attention to the situation. 
5. Treated him as I did whites in similar circumstances.” % 


Questions of personal contact and experience, when scaled, 
pethaps always tend to reflect attitude as well as mere experi- 
ence. Questions of community contact, however, may be 
purely factual without revealing the responder’s attitude. 


For example : “May a Negro run for election to public office 
in your home area? 


5. Yes. Some hold or have held public office. 
4. Yes. They may try. 
3- Don’t know of any such attempt. 


% Ibid., p. 667. 
conclusively that scale values of items in a test constructed by the 


on groups of judges di i udes r 
ss Ince are iv eo pai a significantly in their attit toward the 

; p Scaling Experience by Multiple-Response Technique: A 
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2. Legally, yes, but they never do. 
1. Whites simply would not allow it.” 62 


Ford had the group of five responses for each item sorted 
by seven judges according to the degree of friendship or hos- 
tility indicated. By comparisons of the ranking of the judges, 
ambiguous „items were discovered and virtual unanimity of 
judgment was secured for the statements finally used. Ford’s 
method is essentially a combination of the methods of Likert 
and Thurstone. 

c. S. C. Dodd combined the Thurstone technique with the 
social distance test.°* 


In selecting statements for a social distance test the customary 
subjective judgment of the constructor of the test was replaced b 
the three criteria of (a) equal attitudinal distances,®* (b) minima 
ambiguity, and (c) maximal agreement between panels of judges. 
Thirty-nine statements, ranging from friendly to hostile, were 
rated on an eleven-point scale by sixty judges grouped in two 
panels. The resulting mean rated position of each statement was 
graphed, enabling the selecting of statements at equal intervals. 
The standard deviation (SD) of the sixty ratings of a statement 
measured its ambiguity. The difference® between the mean 
position assigned to a statement by the panel of students from all 
the Near East and by the panel of business men of Beirut is a 
measure of the degree to which the judges’ rankings might de- 
pend on occupation, nationality, religion, age, and other factors 
which differed between the two panels of judges. A correlation 
of .99 was found between the thirty-nine positions assigned by 
the two panels. The five statements with the smallest differ- 
ences between panels were then determined from the candidates 
remaining after applying criteria (a) and (b). The statements 
were assigned scores from o to 100 to serve as units of social dis- 
tance expressed in percentages of the maximal distance, 5o being 
the neutrality point between friendliness and hostility. 


The application of these tests to the conglomeration of re- 
ligious, educational, national and economic groups in the 
Near East, and Dodd's analysis of the results is a fascinating 


63 Ibid., p. 14. 

63"A Social Distance Test in the Near East,” Amer. Jour. Sociol., 41:194-204, 
1935. The method of paired compari was applied to the social distance 
test by H. Gurnee and Elizabeth ker, “The Social Distances of Some Com- 
mon Social Relationships,” Jour. Abnormal and Soc. Psychol., 83:255-69, 1938. 
ĉi See Thurstone and Chave, The Measurement of Attitudes (Univ. of Chi- 


cago Press, 1929). , 
$S Expressed in units of the standard deviation of the difference. $ 
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DATA ON STATEMENTS ACCEPTED 


A. If I wanted to marry, I would marry 
one of them. 


B. I would be willing to have as a guest 
for a meal. 


C. I prefer to have merely as an ac- 
quaintance to whom one talks on 
meeting in the street. 


D. I do not enjoy the companionship of 
these people. 


E. I wish someone would kill all these 
individuals. 


F. I know nothing about this group; I 
cannot express an attitude. 


* Omitted from all calculations, 3 

The test comprised four subtests adapted to the educational, economic, 
national, and religious groupings in the Near East, A contribution of this 
study was intended to be the measurement of social distances between such 
a diversity of groups by means of a single scale and type of unit. 


example of this type of sociological analysis which can be 
made through attitude measurement. 


I, ESTIMATING THE VALIDITY OF THE TESTS 


As in the case of schedules and questionnaires in general, 
tests of the kind illustrated above need to be pretested and 
revised a large number of times before they are used for 
research purposes. Also, it is desirable to attempt to deter- 
mine how closely the verdict of the tests agrees with other 
accepted criteria of existing attitudes. We shall return in 
the next chapter to a more thorough consideration of the 
value of social measuring instruments in general. ‘The fol- 
lowing general tests of validity suggest the type of checking 
to which attitude tests are subject. 

1. The first and most subjective test of the validity of any 
device designed to measure attitudes lies in its logic and rea- 
sonableness with reference to tested existing knowledge. All 


MEASUREMENT OF ATTITUDES 243 


tests should, therefore, first be considered from the stand- 
point of how the principles on which they are constructed 
square with the established principles of science. 

2. Where a number of tests are available, the correlation 
of the results of a particular test with the results of all of the 
other tests or of a group of them gives some presumptive 
basis for judging the particular test. 

3. A third method of estimating the validity of a test is 
to secure a consensus of judgment from a group as to what 
individuals among their close personal friends hold pro- 
nounced attitudes on certain questions. The application of 
the test to these individuals then gives a measure of the de- 
gree to which the test agrees in its findings with the judg- 
ments of close associates. Individuals known for their objec- 
tivity of mind may also be confronted with their own scores 
on the tests and the degree of their acceptance of the results 
may be some indication of the validity of the test. 

4. Certain groups may, in the judgment of competent per- 
sons, be expected to have certain biases. The ability of the 
tests to reveal these attitudes is some indication of the valid- 
ity of the tests. 

5. The degree to which intelligence, education, and famil- 
iarity with the language of a subject, rather than his atti- 
tudes, tends to be measured by the test may be determined 
by correlation of attitude test scores with scores of intelli- 
gence, reading tests, and other factors that are suspected of 
being influential. € 

6. In addition to these general tests of the validity of the 
tests given above, Thurstone suggests several objective meas- 
ures of the validity of the attitude scale after it has been con- 
structed.®® 

(a) The scale must transcend the group measured. — A measur- 
ing instrument must not be seriously affected in its measuring 
function by the object of measurement. . - If a yardstick meas- 
ured differently because of the fact that it was a rug, a picture, or 
a piece of paper that was being measured, then to that extent the 
trustworthiness of that yardstick as a measuring device would be 
impaired. .. The assumption is.. . that two statements on a 
prohibition scale will be as easy or as difficult to discriminate for 
people who are “wet” as for those who are “dry”... 


é L. L. Thurstone, op. cit., pp. 547-52 
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The experimental test for this assumption consists merely in 
constructing two scales for the same issue with the same set of 
statements. One of these scales will be constructed on the returns 
from several hundred readers of militaristic sympathies and the 
other scale will be constructed with the same statement on the 
returns from several hundred pacifists. If the scale values of 
the statement are practically the same in the two scales, then the 
validity of the method will be pretty well established.°? It will 
still be necessary to use opinion scales with some discretion. 
Queer results might be obtained with the prohibition scale, for 
example, if it were presented in a country in which prohibition is 
not an issue. 

(b) An objective criterion of ambiguity.—Inspection of the 
curves in Figure 2 reveals that some of the statements of the 
fictitious diagram are more ambiguous than others. The degree 
of ambiguity in a statement is immediately apparent, and in fact 
it can be definitely measured. The ambiguity of a statement is 
the standard deviation of the best fitting phi-gamma curve 
through the observed proportions. The steeper the curve, the 
smaller is the range of the scale over which it was classified by 
the readers and the clearer and more precise is the statement. 
The more gentle the slope of the curve, the more ambiguous is 
the statement. Thus of the two statements A and B in the 
fictitious diagram the statement A is the more ambiguous. 

In case it should be found that the phi-gamma function does 
not well describe the curves of proportions in Figure 2, the de- 
gree of ambiguity may be measured without siscitacing that the 
proportions follow the phi-gamma function when plotted on the 
attitude scale. A simple method of measuring ambiguity would 
then be to determine the scale distance between the scale value 
at which the curve of proportions has an ordinate of .25 and the 
scale value at which the same curve has an ordinate of .75. The 
scale value of the statement itself can also be defined, without 
assuming the phi-gamma function, as that scale value at which 
the curve of proportions reaches ‘50. If no actual proportion is 
found at that value, the scale value of the statement may be inter- 


polated between the experimental a re 
and below the .50 treks proportions immediately above 


(c) An objective criterion of irrelevance.—In addition to 
these tests of validity, Thurstone gives a “criterion of irrele- 


vance” of the statements used in the scale, which may be 
summarized as follows: 


“The neutrality point would not necessarily be 


same 

oo t mp both erp onr an pacifists, ba i scale ana Bacwers 
pairs of statements be practica same i 

of standardization.” L. L, Tharsone, oboe er dag ss: 
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Assume that the whole list of a hundred statements on 
the subject of militarism-pacifism has been submitted to sev- 
eral hundred readers for their indorsement or rejection. Let 
these readers be asked to mark with a plus sign every state- 
ment which they endorse and with a minus sign every state- 
ment they reject. All of the statements have, it must be re- 
membered, already been rated on the basis of the degree of 
pacifism they express and have been assigned a value in a 
frequency scale on the basis of the previous ratings. Ifa state- 
ment is relevant for our purpose, i.e., if it is accepted or re- 
jected primarily because of the attitude on pacifism which it 
portrays, then the rating of the other statements should ar- 
range themselves with reference to it in a more or less normal 
frequency distribution. If on the other hand, the statement 
under consideration has implications which lead to accept- 
ance or rejection for other reasons than the degree of paci- 
fism which it conveys, i.e., if it is irrelevant for our purpose, 
then the ratings of the other statements with reference to the 
basic statement will not fall in the form of a normal fre- 
quency distribution. 

For example, assume that the hundred statements have 
been arranged as a frequency distribution in a scale from o 
to 8 on the basis of the degree of pacifism they represent in 
the opinion of a large number of readers. Assume that we 
desire to test the relevance of a statement which falls at point 
4.0 in the scale. If the readers indorse or reject the state- 
ment chiefly on the basis of the degreé of pacifism it repre- 
sents, then these readers who indorse statements in the vi- 
cinity of 4.0 on the scale will not often indorse statements 
that are very far away from that point on the scale. Very 
few of them should indorse a statement scaled at point 8.0, 
for example. “If a large proportion who indorsed the state- 
ment scaled at 4.0 should also indorse a statement scaled at 
8.0, then we should infer that their voting on these two state- 
ments had been influenced by factors other than the degree 
of pacifism that is implied in the statements.” 

We shall consider further in the next chapter certain basic 
questions of the validity of sociometric scales in general. In 
that connection we shall consider also the query: “What do 


sociological scales measure?” 
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J. SUMMARY AND CONCLUSION 
- 


We have described and illustrated above the principal 
methods currently used in attempting to measure a kind of 
human behavior. That behavior is called attitudinal be- 
havior and was characterized in the opening sections of the 
present chapter. The strength and weakness of different 
methods has been noted in the course of the discussion. In 
conclusion, some considerations regarding attitude measure- 
ment in general should be mentioned." 

1. In the first place, the above examples of opinion ques- 
tionnaires and tests in common use do not, of course, pre- 
tend to represent an exhaustive or complete list of such de- 
vices. Each general type has numerous variations to suit 
particular subjects or purposes. + In addition to these rela- 
tively formal methods, we pointed out in our classification 
‘of methods (pp. 220-21) that illuminating studies of attitude 

. have also been made through participant observer techniques, 

interviews, studies of voting behavior, and the analysis of 
life history documents. These will be considered in a later 
chapter. 
_ The ‘examples exhibited in this chapter are submitted as 
illustrative of the more formal and direct devices at present 
employed to determine certain stimulus-response patterns 
which exist in social groups. The behavior or intercom- 
munication constituting these action patterns is the subject- 
matter of social psychology. As we have seen, this inter- 
communication is cartied on chiefly by means of symbols — 
principally by language symbols. Hence we study these 
action-patterns — the attitudes — through the observation and 
interpretation of language responses, not only as verbal be- 
havior, but as indications of more complete adjustments. 


"Attention is again directed to two excellent summaries of the present 
status of attitude measurement by L, W. Pwhich 
em anys: by A Ferguson and Daniel Day, to whi 
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2. It may be that a person is largely unable to analyze all 
aspects of his attitudes and opinions. But the analysis he 
does make is significant, as psychiatrists and others well know. 
At least his expressed attitude shows what he wants people 
to think he thinks. “I conclude...” says Cooley, “that the 
imaginations which people have of one another are the solid 
facts of society, and that to observe and. interpret these must 
be the chief aim of sociology.” ™ 

We have emphasized, however, that the degree of corre- 
spondence between verbal behavior and other behavior is 
not of itself a conclusive criterion of the validity of the re- 
sults secured through an attitude scale. To be sure, such 
correspondence is of interest and great practical significance. 
A great deal of the interest in attitude studies undoubtedly 
arises from a desire to predict also non-verbal behavior. But 
this should not cause us to overlook the fact that attitudinal 
behavior has its own significance. To be able to predict atti- 
tudinal behavior may be quite as significant as any other kind 
of prediction. The relationship of attitudinal to other be- 
havior is, in fact, a separate problem. 

3. It must always be remembered that responses to ques- 
tionnaires and tests are speech responses to a total situation, 
and not only a response to the words of the questionnaire or 
to the hypothetical situation they represent. All the condi- 
tions under which the questionnaire is given are part of the 
situation. If the questionnaire calls for attitudes on sub- 
jects governed by strong mores, the’subject’s expressed atti- 
tude will tend to be the conyentional attitude, especially if 
his name is required on the schedule.” That is, the subject 
will express what he thinks his attitude ought to be rather 
than what his behavior in the actual situation might be. As 
a verbal response his expressed attitude may still be of value 
and interest. But too many studies of attitudes assume that 
the verbal response must be a reliable index of other be- 
havior. Discrepancies between the two can be partly elim- 
inated by the form of the schedule, anonymity of the replies, 

71C. H. Cooley, Human Nature and the Social Order, (Scribners, 1902), 
Perie S. M. Corey, “Signed Versus Unsigned Attitude Questionnaires,” 


Jour. Educ. Psychol., 28:144-48, 1937- This author suggests on the basis of an 
empirical study that the significance of anonymity has been exaggerated. 
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and careful standardization of the conditions under which 
it is presented. If we knew enough about those conditions 
which prevent people from revealing their “true” attitudes 
in a given situation, we could measure the influence of dis- 
turbing factors and allow for them in our interpretation as 
does the physical scientist in analyzing and interpreting a 
situation outside the control of his laboratory. A correlation 
of “experience” or “situational” test results, not to mention 
everyday behavior, with the results of opinion and belief 
scales, is precisely what is needed in this respect. We have 
already referred to LaPiere’s studies of this subject.*® Where 
possible this adds greatly to the accurate interpretation of 
the results. 

4. Full allowance must be made in the interpretation of 
attitude statements for the differences in language equip- 
ment, education, and general cultural background on the 

‘part of the subjects. Ambiguity of words and misunder- 
standings and misinterpretations on the part of the subjects 
may invalidate the replies as expressions of attitudes. These 
-replies are, furthermore, subject to all the rationalizations 
and fantasies that play so large a part in thinking and in 
speech reactions. Antagonisms and inhibitions aroused by 
a question may prevent a frank expression of attitude. 

5. The objection has been raised that the methods of meas- 
uring attitudes reviewed above are too specific in content to 
measure phenomena like attitudes, allegedly more general in 
character. Kirkpatrick and Remmers% have accordingly 
constructed tests designed to allow for the more adequate ex- 
pression of general attitudes, ere seem to be no grounds for 
cana that attitudes of all degrees of generality exist, and 

at they are subject to measurement by essentially the same 

See R. T. LaPiere, op. cit. 
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types of techniques. Practically, therefore, this question re- 
duces itself largely to what type of attitude interests us, and 
the types of items appropriate to the end sought. 

From the theoretical point of view, it is well to remember 
in this connection that no measurement of any phenomenon 
necessarily describes it completely. No scientific instru- 
ment pretends to measure all aspects or dimensions of the 
“objects” designated by folk language. It selects out only 
certain revelant aspects of an object or of behavior, and 
measures it. Now, one aspect of attitudes—the aspect we 
are interested in for our present purpose —is the “more or 
less” aspect of opinions relating to some subject. This is an 
aspect of people’s opinions which we do measure empirically 
in our everyday contacts, when we say, for example, that one 
person is more radical or more conservative than another 
on some question, or that one person is more in favor of a 
specific proposal than another. All that the technology dis-* 
cussed in this chapter proposes to do is to make this pro- 
cedure more objective and accurate. 

6. Finally, the complete cultural relativity of all attitude 
measurements should be recognized. As Thurstone has 
pointed out, significant results could hardly be expected 
from a scale designed to measure attitudes toward prohibi- 
tion in a country where prohibition is not an issue. Simi- 
larly, a scale to measure attitudes toward labor unions would 
obviously not yield significant results if applied to a non- 
industrial society. The possible influence of difference in 
cultural background must also be considered in much less 
extreme instances as, for example, when college people devise 
scales to be used on non-college populations. It is true that 
Thurstone scales have been found to be remarkably inde- 
pendent of the attitudes of the judges sorting the statements. 
But this is a subject which must be considered whenever at- 
titude scales arejused. It is not, however, a handicap unique 
to attitude scales, The readings of most physical measuring 
instruments must also be adjusted if these instruments are 
used under conditions varying widely from those for which 

76 This subject is also related to the questions of “what” attitude tests 


measure, the “nature” of “real” attitudes, etc. See Chap. IX, pp. 302-06 
below, Also, G. A. Lundberg, Foundations of Sociology, pp. 61-65. 
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the instrument was constructed. An ordinary thermometer 
will give reliable readings only between certain temperatures 
and the readings of other instruments will vary, for example, 
with altitude. Techniques of detecting and allowing for 
such variations are, of course, an important aspect of research 
in attitude measurement. 

7. Finally, there remain some technical and theoretical 
problems regarding the “nature” of the units used in attitude 
scales, and the consequent permissibility of certain mathe- 
matical operations with the scores secured." These issues 
do not greatly affect the validity or demonstrated practical 
usefulness of the type of measurements discussed in this chap- 
ter as instruments of discrimination, but they are questions _ 
of theoretical and scientific significance that can be answered 
only by further research. The last decade has seen an enor- 
mous increase and improvement in attitude measurement, 
with a pronounced trend in favor of the equal-interval scales.8 


1 For a brief statement of these alle limitations of equal-interval scales, 
see H. M. Johnson, “Some Neglected Principles in Aptitude-Testing,” Amer. 
Jour. Psyc ol „47159-65, 1935; “Pseudo-Mathematics in the Mental and 
Social Sciences,” Ibid., 48:342-51, 1936; C. Kirkpatrick, “Assumptions and 
Methods in Attitude Measurements,” Amer, Sociol. Rev., 1:75-88, 1936; 

in Ethnic Opinionnaires,” Amer. 
Sociol. Rev., 5:18-28, 1940. For comments on Merton's’ paper, see remarks 
by G. A. Lundberg in the same journal, pp. Fag , and by Jessie Bernard, 
same journal, pp. 415-17, together with a re; E Merton, pp: 647-48. A 
fuller statement of the present author's position will be found in G. A. 
Lund + Foundations of Sociology, (Macmillan, 1939), Chap. II. A some- 
what di t view will be found in M. R. Cohen and E. Nagel, An Intro- 
duction to Logic and Scientific Method, (Harcourt, 1934), Chap. 15. 

Dh Day, (op. cit., p. 400) gives the following summary : 
TABLE: IV 


Comparison by number and percentage of attitude studies published in the periods 1925- 
employed 


1931 and 1997-1939 classified by methods 
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There is no question that great progress has been made to- 
ward a more refined technique of measuring a type of phe- 
nomenon with which the social sciences must be concerned 
to a large degree. There is great demand for reliable ways 
of gauging attitudes—for more accurate means of determin- 
ing and predicting the social weather. Leaders and propo- J 
nents of social programs of every kind always have been and 
always must be concerned with the attitudes of the group 
they want to influence. The technique thus far has been 
chiefly of the intuitive, rule-of-thumb type. An improve- 
ment is all that is claimed for the methods considered in 
this chapter. The future of this type of research must de- 
pend on still further improvements in these methods. 


K. SUGGESTIONS FOR FURTHER STUDY 


1. By far the best brief digest of the recent literature on 
the measurement of attitudes is Daniel Day, “Methods in 
Attitude Research,” Amer. Sociol. Rev. 5:395-410, 1940. 
This excellent article contains an annotated bibliography of 
200 items covering the literature which appeared in readily 
available books and periodicals during the years 1937-39 
inclusive. An even more pointed and critical discussion of 
the subject is L. W. Ferguson, “The Requirements of an 
Adequate Attitude Scale,” Psychol. Bull., 36:665-73, 1939. 
The status of different types of scales from the point of view 
of the specified requirements, with a supporting bibliography 
of 45 items, is incisively discussed. For a more general He 
cussion, see D. E. Scates, “The Essential Conditions of Mea 
urement,” Psychometrika, 2:27-34, 19373 and by the same 
author, “How Science Measures,” Jour. Exper. Ed., 5:296- 
$12, 1937- - 

2. A somewhat longer discussion of methods, as well as a 
digest of studies and results, is Chap. 1 in Experimental So- 
cial Psychology, by G. and L. Murphy and T. Newcomb (Re- 
vised Edition, Harper, 1937). See especially the summary 
tables, pp. 948-51, 956-59, 982-85, 998-99- 

3. The best general source on all types of scale construc- 
tion, together with detailed methods of computing scale 
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values and scores, is J. P. Guilford, Psychometric Methods 
(McGraw-Hill, 1936). For detailed accounts of the tech- 
niques of construction of particular scales, see L. L. Thurs- 
tone and E. J. Chave, The Measurement of Attitude (Uni- 
versity of Chicago Press, 1929); G. Murphy and R. Likert, 
Public Opinion and the Individual, (Harper, 1938), Chap. 3 
and Appendix B; C. Kirkpatrick, “The Construction of a 
Belief-Pattern Scale for Measuring Attitudes Toward Fem- 
inism,” Four. Soc. Psychol., 6:421-37, 1936; E. A. Rundquist 
and R. F. a ik Personality in the Depression (University of 
Minnesota’ Press, 1936); A. C. Rosander, “An Attitude Scale 
Based upon Behavior Situations,” Four. Soc. Psychol., 8:3-15, 
1937. See also, T. C. McCormick and R. C. Schmid, “A 
System of Attitude Experiments,” Soc. Forces, 19:351-56, 
1941. For a set of scales designed to measure to what extent 
motion picture plays violate the mores, see C. C. Peters, Mo- 
tion Pictures and Standards of Morality, (Macmillan, 1933). 
See also, R. F. Sletto, Construction of Personality Scales by 
the Method of Internal Consistency, (The Sociological Press, 
Hanover, N. H., 1937). 

4. References to methods of measuring a great variety of 
aspects of personality will be found in G. W. Allport, Per- 
sonality, A Psychological Interpretation, (Holt 1937). A 
good general treatise is E. L., Thorndike, The Psychology of 
Wants, Interests, and Attitudes (D. Appleton-Century, 1935): 

5. A Bibliography of Mental Tests, by Gertrude Hildreth 
(The Psychological Corporation, New York, 1939), contains 
129 feferences to attitude tests. 

6. For an important contribution toward a resolution of 
some controversial points regarding the types of scales that 
have been discussed in this chapter, see F. $. Chapin, “The 
Syntactical Analysis of Sociometric Techniques: Cases in 
Point.” Sociometry, 4:177-83, 1941. 

7 Ward H. Goodenough, ”A Technique for Scale Analy- 
sis’; Educ. and Psych. Meas., 4:3, pp- 179-190, Autumn, 
1944. 

8. Louis Guttman, “A Basis for Scaling Qualitative Data”; 
Amer, Sociol. Rev. 2, PP- 139-150, April 1944. 

9. Herbert S. Conrad, “Statistical Methods Related to 
Test Construction and Evaluation”; Review of Educ. Re- 
search, 14:110-126, Feb, 1944. 


CHAPTER IX : 


THE MEASUREMENT OF INSTITUTIONAL 
BEHAVIOR 


. 


_For practical reasons even more than for purely intellectual ones, we need 
rigorously scientific studies of human society and of our individual relations 
to it. In particular we need such studies of the societal interests that are 
labeled “public policy,” “education,” “missions,” and “social work.” I am 
aware that this proposition is resented by men and women who suffer from 
an anti-“academic” complex and ig lest the “human touch,” “the ways of 
the neighborhood” and the naive thinking of “plain le” shall have 
spontaneity squeezed out of them by theory. This is an unfortunate mis- 
apprehension of what science is and of what it does for us. It ought not to 
be necessary at this late day, but it is necessary, to tell the general public 
that science is nothing more nor less than getting at facts, and trying to 
understand them, and what science does for us is nothing more nor less t 
helping us to face the facts. — F. H. Giddings. 

THE DEMAND FOR MEASUREMENTS OF INSTITUTIONAL 


BEHAVIOR 


The appraisal of institutional behavior with reference to 
norms, either for comparison with other communities or with 
respect to ideals or ends toward which communities strive, 
has always been a matter of interest. In a small and homo- 
geneous society of close personal contacts, the subjective, 
empirical, evaluative judgments of everyday life were ac- 
cepted as adequate. But with the development of a large 
and heterogeneous society, there has come an increasing de- 
mand for evaluations and measures which shall be imper- 
sonal and shall have the general validity and objectivity of 
measures in the physical sciences, in business, and in indus- 
try.» In education and psychology complicated intelligence 
and achievement tests have been developed which enable us 
to compare pupils as individuals and as groups with each 
other, with pupils in other schools, and with pupils in other 
cities, states, and countries. As long as our interest in the 
problem was confined to the one-room school with a dozen 
children, the teacher’s subjective rating of the pupils served 
all the purposes for which ratings were desired. But today 
school executives demand ratings in terms that will enable 
comparisons with standards and norms far beyond the ex- 

1The Scientific Study of Human Society, (University of North Carolina 
Press, 1924). 
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perience of the individual teacher. On the basis of such 
comparisons they draw conclusions regarding the efficiency, 
needs, and problems of different schools. Colleges and uni- 
versities demand an evaluation of prospective students in 
standard terms. These ratings are taken into serious con- 
sidération in determining the disposal of individuals, whether 
as delinquents or as candidates for honors, Standard meas- 
ures of “intelligence” and school achievement are becoming 
items of information in social diagnosis in a way comparable 
to such items,as height and weight in a physical examination. 

Obviously, the same needs which have given rise to these 
measures of the individual are also present when we con- 
sider groups and institutions. If we admit the importance 
of home environment in cases of juvenile delinquency, then 
an intelligent diagnosis of a case demands an index of such 
environment for the same general reason that we desire an 
intelligence quotient. Both aid us in accounting for the de- 
linquency, and indicate the degree to which each factor may 
be responsible. Likewise a business man who is accustomed 
to insist on a rigid weighing and measuring in objective 
terms of money expended and returns received is increas- 
ingly inclined to demand an equally tangible accounting 
from the charitable, educational, or governmental institu- 
tions he is supporting. 

We are concerned in the present chapter with methods of 
grading institutions, organizations, and communities. The 
types of such institutions and organizations are legion, and 
eachitype demands a different kind of yardstick, but the gen- 
eral principles for the construction of such yardsticks are the 
same for all. Wershall consider, therefore, (1) the general 
requisites and desirable attriljutes of a valid sociometric 
scale ; (2) the concrete procedure in the development of such 
a scale, including the difficulties encountered in its construc- 
tion. After considering the theory and principles of con- 
structing sociometric scales, we shall conclude with a con- 
sideration of several concrete examples of them. 


B. GENERAL REQUISITES OF A SOCIOMETRIC SCALE 


The five principal general requisites of a desirable socio- 
metric scale are as follows : 
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1. It must be generally applicable, reliable, and valid. A 
measuring instrument is considered reliable when it gives 
the same or highly similar results when repeatedly applied 
to the same or highly similar phenomena. A scale is called 
valid when it measures in some demonstrable way what it 
claims to measure. We have touched on these requirements 
in the preceding chapter and shall return to them again in 
a later section. By the general applicability of a scale we 
mean that it must remain reliable and valid when applied 
to any case within the range or class of phenomena for which 
the instrument was designed. How broad that cla8s is to be 
depends entirely upon our purpose in constructing the scale. 
We may construct a scale measuring socioeconomic status of 
coal miners in Pennsylvania or of the entire population of 
the United States or even a more heterogeneous population. 
It is true that as a science matures it tends to develop some 
measures of extremely general validity, such as measures of 
length and weight. Even mature sciences, however, are con- 
stantly constructing special instruments to measure highly 
selected phenomena. These instruments sometimes have 
limited applicability. 

How general an instrument should be in its application 
is a question to be decided with reference to the problem we 
wish to solve. The more generally applicable a scale can be 
made without impairing its reliability and validity, the bet- 
ter. Practically, there are usually limits beyond which it is 
not desirable to extend the applicability of any particular 
scale. If we strive for too great generality we may seriously 
impair the scale’s validity as an intensive measure of a more 
restricted type or class of objects. For example, if we at- 
tempt to make a measuring scale for homes so general in its 
application that it will measure with validity the homes of 
Chinese coolies or of Australian aborigines as well as modern 
American homes, the requirements of such a scale might call 
for the inclusion of many items which would be necessary 
for the effective rating of one class of homes, but which 
would be totally irrelevant for another class. This-might in 
turn seriously impair the value of the scale as an intensive 
measure of any of the classes. Likewise, the American Pub- 
lic Health Association, in its Appraisal Form for Public 
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Health Work which will be discussed later, found that a 
scale for rating the health departments of cities of 50,00¢ 
population, or over, would not be a fair measure of the 
health departments of smaller cities in many respects. For 
example, while the employment of a full-time health officer 
may be a reasonable point on which to score a city of over 
50,000, this item would not necessarily be significant as a 
basis for grading in a smaller city, because (a) none of the 
smaller cities may have such provision, and (b) the presence 
or absence of a full-time official, in a small city, may not be 
an important index of the efficiency of health work in the 
city. While a scale should be generally applicable to the 
range of objects for which it is intended, we should avoid 
classifying together objects which have too little in common. 
It is better to limit the range of classification and to estab- 
lish norms for each particular class, 

2. It must allow for and make possible the establishment 
of reasonable norms relative to which the phenomena meas- 
ured can be compared. This is largely a corollary of what 
has been said above regarding the classification of the ob- 
jects or aspects to be measured. If we attempt to apply a 
scale to too wide a range of objects, the norms for some mem- 
bers of the class will be Utopian and largely worthless for 
Practical purposes. One way to arrive at a norm is to take 
the statistical mode of a representative sample of the group 
to be measured. When it is not possible to establish a norm 
in this way, standards may be tentatively set by the combined 
judgments of experts or others regarded as best qualified to 
pass on the particular point under consideration. Methods 
of selecting such experts or authorities so that the results will 
command respect and obviate ‘Personal biases as far as pos- 
sible, will be considered in a later section. 

3. A third requisite of a good scale is that the factors 
which go into its construction must be generally available. 
It is frequently true in the present Stage of development of 
the social sciences and with the present methods of record 
keeping in public institutions that many factors of great sig- 
nificance in determining the efficiency of an institution are 
not at present definable in terms sufficiently objective to per- 
mit comparison with corresponding factors elsewhere. In 
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such cases we cannot include such factors in a measuring 
scale, regardless of how important the factors may be. This 
is a difficulty which militates against the refinement and 
validity of all sociometric scales at the present time. We 
have not yet developed techniques for measuring many sig- 
nificant types of behavior. The development of such tech- 
niques is, as we have seen in previous chapters, the central 
problem of the social sciences. Until science makes avail- 
able more objective definitions and measures of the signifi- 
cant factors in institutions we must utilize the factors and 
measures generally available. It avails us nothing in the con- 
struction of a practical measuring scale to require informa- 
tion in cur scale which is not available in the great majority 
of cases which we propose to measure, regardless of how de- 
sirable that information, may be. We must confine our 
analysis to factors available, although the resulting inade- 
quacy will to some degree invalidate our scale as a reliable 
measuring device. 

4. Within the limitations set forth above, a fourth requi- 
site of a sociometric scale is that it should take into consid- 
eration all significant aspects and factors for which data can 
be secured. Here again we are confronted with the problem 
of defining significance, and of deciding which factors are of 
sufficient significance for our purpose to justify their inclu- 
sion in our scale. This problem involves considerations of 
units and other matters that will be discussed later. In the 
ery of means of statistical or other objective demonstra- 

on of the role played by each factor in the total function- 
ing of the institution to be studied, we must again rely for 
tentative evaluation on the next best criterion, namely, on 
the combined judgment of agroup of persons best qualified 
to judge. In the end these evaluations are tested by what- 
ever criteria we finally accept as indicative of the validity of 
the scale as a whole, which we shall discuss later in this chap- 
ter. 

We must have not only a selection of significant factors to 
include in our scale, but an evaluation of the degree of their 
significance. This leads us to the final requirement of a 
satisfactory scale. 

5. Each factor included in the scale must be weighted ac- 
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cording to its relative significance. If we enumerate a dozen 
factors entering into the functioning of an institution, some 
will be of greater significance than others. A reliable scale 
must allow for this fact. The problem is largely the same 
as that involved in the original selection of the factors to be 
included. Both will be determined in the last analysis by 

e purpose of the scale, i.e., what is to be measured or 
solved. In the absence of objective demonstration of the 
relative significance of different factors, we must again rely 
tentatively on the combined opinion of persons best quali- 
fied to judge. This is frankly a concession to necessity in 
the absence of scientifically developed norms, With the de- 
velopment of scientific study of social institutions there is 
no reason why even ethical judgments should not be based 
on relatively objective and verifiable data. In the mean- 
time we are compelled to rely tentatively on our nearest ap- 
proximation to scientific evaluation, namely, the combined 
judgment of persons in closest touch with the situation and 
whose habits of objective thought are generally recognized. 
j The application of these principles in specific cases will be 
illustrated below. With these general principles before us, 
we may-now proceed to a consideration of the concrete pro- 
cedure in the construction of a scale, including a considera- 


Pe of the difficulties involved and the ways of meeting 
em. 


C. PROBLEMS IN THE CONSTRUCTION OF A SOCIOMETRIC SCALE 


The first step in the construction of a sociometric scale is 
the selection of the behavior, institution, organization, or 
aroan, of earn are to be measured» The unit 

easure vary from some prima: oups such as 
the family or neighborhood to the peat n Leta and de- 
rived groups, such as an industrial or business organization, 
the educational system as a whole or the public libraries 
alone, a department of the city government, the whole gov- 
ernment, or the city as a whole. In addition to the selection 
of the unit to be measured, we must also decide what aspect 
of the unit we desire to measure. We might develop a scale 
designed to rate cities from the standpoint of their relative 
advantages for general industrial development; or for the 
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development of a particular industry; or as health centers. 
Again, we may design our scale to measure cities from a much 
broader viewpoint such as their general desirability for peo- 
ple of specified tastes, as a place in which to live. Other units 
are subject to the same variety of rating systems. 

It is well to bear in mind that all measurement, whether 
of the solar system or of a grain of sand, is specific in that thë 
measuring instrument must specify what it takes into con- 
sideration. No measurement describes all aspects of any nat- 
ural phenomenon. To measure a tree usually means to 
measure its height. The frequency with which the term meas- 
urement in this case is used to designate the measurement of 
that particular aspect sometimes causes us to assume that in 
the other sciences “objects” “as a whole” are measured, and 
that in such cases the problem of what aspect to measure does 
not arise. But the situation is clearly not different in this 
respect from what confronts us in the measurement of social 
phenomena. To be sure, it it possible to combine measure- 
ments of many aspects into composite indices. But such in- 
dices, while “general” in their inclusiveness, are as specific 
in denotation as are their constituent factors. 

Sociometric scales aim, of course, to. discriminate accu- 
rately between different gradations of highly intricate com- 
plexes of social behavior. The scores assigned to the dif- 
ferent cases measured may also serve as indices of a number 
of other social conditions with which the aspects measured 
happen to be reliably associated. The discovery of such cor- 
relations must, indeed, be a major task of the social sciences. 
The physician is able to tell a great deal, and to predict 
about a patient, from readings of a few simple instruments 
only because of well established correl tions of this kind. 
Scales of the type here under discussion facilitate the estab- 
lishment of such correlations and herein lies their larger 
scientific significance. 

Two principal problems in the construction of sociometric 
scales, then, face us at the outset: (1) How shall we select 
the aspects or factors of a unit which we deem significant and 
which are therefore to be considered in our scale? (2) How 
shall we determine the relative weight to attach to each fac- 
tor included ? 
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D. THE FACTORS AND WEIGHTS IN A SOGIOMETRIC SCALE 


The question as to what factors are significant in a situa- 
tion is, of course, a recurrence of the problem which arises 
at the beginning of every investigation and which we con- 
"ae in a previous chapter in our discussion of the work- 

ng hypothesis. In all situations which have not been scien- 
tifically analyzed into the factors which can be demonstrated 
to operate in certain sequences, we assume such factors and 
sequences from intuition, analogy, logical deduction, or em- 
pirical observation. But in every situation, there are some 
people who are better qualified than others to select the sig- 
nificant factors in a specific situation. Such superior qualifi- 
cations spring from closer contact with the facts in the case 
and training in logic and objectivity of mind. These are 
obviously the persons to make the original selection of the 
significant factors to include in a sociometric scale. We 
shall deal in a later section with methods of verifying these 
judgments. 

' The selection of a limited number of the best qualified 
persons for a particular purpose, however, is itself a prob- 
lem. In'the case of the sociometric scale we need two types 
-of such persons to correspond tothe two main problems 
mentioned above : (1) General judges to select the significant 
categories, or factors in a situation and to assign weights to 
them according to their relative significance. (2) Special 
judges to determine the criteria of efficiency of the function- 
ing of each special factor selected by the general judges. 
The first class calls for persons of broad experience rather 
than of intensive ee persons who can see social 
Institutions and p Ocesses as a whole and in proper perspec- 
tive, and who can weigh the many interrelated elements with 
a high degree of objectivity. A group of representatives of 
the different social sciences would perhaps be the best quali- 
fied for this task. The second class of judges calls for spe- 
cialists in each field selected as a factor in the unit to be 
measured — the physician, the banker, the engineer. 

For example, if it is desired to construct a scale for meas- 
uring the relative desirability of cities as good places in which 
to live for people of conventional tastes, the first task is to 
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select the general departments or factors in a city which are 
considered of major significance from this point of view.? 
Who shall make this selection and determine the relative 
weight to be attached to éach factor? Probably the best 
qualified person for such a task would be representative per- 
sons of the population at large. A group of economists, sof 
ciologists, political scientists, and city managers might be 
used, provided these professionals may be assumed to be 
familiar with the public taste, as perhaps contrasted with 
their own tastes, or their notions of what the public ought 
to want. Suppose that a committee consisting of such per- 
sons selects Health, Education, Public Administration, Rec- 
reation, and Industry, a$ the significant factors to be in- 
cluded. Another type of judge is obviously desirable to 
select and evaluate the important factors determining the effi- 
ciency of each of these departments. That is, a scale for the 
rating of health departments or educational systems would 
call for specialists in these respective fields—health admin- 
istrators and educators. ; 
The above division of judges into two classes on the basis 
of whether they are to determine the general factors to be 
considered and the weights to be applied to each, or whether 
they are to determine the methods of rating each general 
factor so selected, is, of course, itself a relative matter. Thus, 
in the example employed above, the persons charged with 
the duty of formulating a rating system for health depart- 
ments would be regarded as special judges— public health 
experts. In formulating a scale for grading health depart- 
ments, however, these judges would in turn assign special 
aspects or factors of public health work such as tuberculosis 
control, food, milk and water inspection, to the specialists in 
these subdivisions of the public health department. That is, 
specialists in tuberculosis control would formulate standards 
and methods of rating the effectiveness of the anti-tubercu- 
losis organization. Experts in sanitation would do the same 
with reference to purity of foods, milk, and water, and the 
adequacy of the sewage system. But the relative importance 
to be assigned to tuberculosis control as compared to general 


2E. L, Thorndike, Your City, (Harcourt, Brace & Co., 1939), has selected 37 
such indices of “goodness of life for good people.” See especially Chap. 2. 
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sanitation would have to be determined by the judges of 
health departments as wholes. Thus, with reference to the 
city as a, unit, the administrators of health departments 
would be special judges. But With reference to particular 
aspects of public health work they would be general judges. 
Por the rating of units larger than the city, likewise, the gen- 
eral judges for the selection of significant factors in the city 
as a whole might become special judges, relatively speaking. 
The distinction between general and special judges is, there- 
fore, subject to extensive subdivision and is always relative. 

The selection of the persons best qualified to construct a 
sociometric scale must, then, always be governed by the spe- 
cific function they are to perform. Expertness may be de- 
fined in terms of all degrees of generality and specialization. 
The general superintendent of a large industrial plant may 
be regarded quite as properly a specialist and an expert with 
reference to his particular task, as is the most highly special- 
ized scientist in the laboratory of the plant. The efficient 
Plant will consist of a hierarchy of experts, each of which is 
best qualified to determine the profitable division of labor 
within his own sphere and, consequently, to pass on the qual- 
ifications of persons to perform incr asingly specialized tasks. 


“The přoblei of selecting the best-qualified persons for the 
construction of a sociometric scale of the type here under . 


consideration resolves itself, therefore, largely into a question 
of selecting the general judges of the unit as a whole. After 
they have been chosen, the selection of judges of subordinate 
units must be accomplished by the generally accepted meth- 
Faty determining the qualifications of persons in different 
hi è 

Various methods of determining the relative eminence of 
men in the larger fields of human endeavor have been de- 
vised. One of the best known of these attempts is that by J- 
McKeen Cattell in his American Men of Science. In brief, 
Cattell’s method was as follows: “In each of the twelve prin- 
cipal sciences the names were arranged in the order of merit 
by ten leading students of science. The average positions 
and the probable errors were then calculated so that in each 
science the order of merit was determined together with its 
validity. The names were then combined in one list by 
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interpolation, the number of workers in each science being 
taken approximately proportional to the total number of 
workers in that science.” ë 

Cattell’s system of selection, however, is adapted only to the 
better-established sciences. It is not generally applicable to 
the selection of all the types of persons who may be beii 
qualified to pass judgment oñ certain aspects of a sociometri 
scale. A method more generally applicable is suggested by 
Gould‘ Briefly, his procedure is as follows: (1) All the 
names in Who’s Who in America were classified according 
to occupations, in so far as this was possible. A compara- 
tively small number of names had to be omitted because of 
the ambiguity of the occupation stated. (2) The number of 
electors for each occupation was determined according to 
the number in the whole group. (3) Each of the electors was 
asked to rate in order of merit the twenty individuals in 
their occupational group whom he considered most eminent. 
While only ten judges were desired, twice this number were 
rated in order to provide substitutes for such of the first ten: 
as were unable or unwilling to serve. (4) The rankings so 
secured were combined by adding the rankings given by all 

_ the electors to each eminent person in the respective fields. 

The ten persons in each field ranking the highest by this coms 
bined judgment were selected as judges in théir particular 
specialty. In one of the occupational categories were classi- 
fied the social scientists, community experts, general psychol- 
ogists, physiologists, botanists, geographers, and geologists, 
as a group from which the best qualified persons might be 
drawn to serve as general judges of the sociometric scale. 
They were selected: in the same way as the experts in more 
highly specialized fields. “ 

Each profession and occupation usually has its own meth- 
ods of determining the relative competency of its members, 
and these methods will ordinarily be employed in selecting 
persons to pass on the validity of the factors and weights 
employed in a sociometric scale. There are, furthermore, 
in many departments of social organization, national associa- 

2 J. McKeen Cattell, American Men of Science, (Science Press, 1906), Preface 


to the First Edition. 4 
4K. M. Gould, A Sociometric Scale for American Cities. (M.A. Thesis, 


Columbia University, 1921.) 
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tions which devote themselves largely to the task of setting 
standards, such as the Child Welfare League of America, 
The National Child Labor Committee, the Children’s Bu- 
reau of the federal government, the National Recreation As- 
sociation, the American Public Health Association, and many 
others. The standards advocated by these organizations have 
usually been developed and tested by leading authorities. 
Whenever possible, their standards should be utilized in the 
construction of sociometric scales in preference to specially 
constructed norms. As such organizations develop they can 
test and revise further the standards necessary to develop re- 
liable sociometric scales. 

While the problem of selecting the best qualified persons 
for the construction of sociometric scales should be left as far 
as possible in the hands of organizations specializing in the 
field concerned, some general problems should be considered 
at this point, touching the construction of all scales, regard- 
less of the subject matter which they are designed to measure. 
Among the most important of these problems is the type of 
units and indices desirable. Everything which was said in 
an earlier chapter regarding appropriateness, clarity, meas- 
urability and comparability of units applies to the selection 
“of measuring units for sociometric scales. But some special 
applications of these principles should be mentioned. 

Since all the significant aspects and factors of the unit 
measured cannot as a rule be included in a scale, factors 
should be selected which touch most directly and vitally the 
problems studied. Elmer has suggested that in the construc- 
tion of a scale for most organizations three types of data are 
desirable : 5 (1) Population (demographic) data, or the rela- 
tionship between the number of individuals who cooperate 
with or belong to the organization and the total eligible pop- 
ulation of the community. (2) Institutional data, or meas- 
ures of the degree to which the members participate in the 
work of the,organization. (3) Functional data, or measures 
of the degrée to which the purpose or purposes of the organ- 
ization are being achieved. From the standpoint of the 
organization alone, the last of these measures would perhaps 
be sufficient, But from the standpoint of the community, 

5M. C. Elmer, Social Research, (Prentice-Hall, 1999). pp. 298 ff. 
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the first is obviously important. The second may frequently 
be an index of the third. The relative weight to be assigned 
to each class of data will rest on experience and the combined 
judgment of the persons best qualified to judge the group 
concerned, as — above. 

While these three categories are suggestive of the general 
types of data which are desirable, they are not literally ap- 
plicable to all social units. For example, a public health 
department could not be scored in terms of the first type of 
data, namely, the proportion of the inhabitants who belonged 
to or participated in the organization. In such cases, how- 
ever, this type of data is really supplied by information on 
per capita costs, per capita equipment, and personnel per 
unit of population. Likewise institutional data may be in- 
terpreted to include the training and experience of the di- 
recting force, and the presence of health machinery — hos- 
pitals, clinics, etc.© Functional data here, as in all cases, 
would, of course consist of the various indices of accomplish- 
ment—the extent of the activities to prevent sickness, the 
number of visits to clinics, the number of nursing visits, vac- 
cinations, etc. 


E. SUMMARY OF THE PRINCIPLES OF CONSTRUCTION OF 
SOCIOMETRIC SCALES 


From these illustrations we may summarize the guiding 
principles in the construction of a sociometric scale as fol- 
lows: 

1. We should aim to select factors and units which meas- 
ure the degree to which an institution achieves its purpose 
or purposes. In the last analysis, this must be the crucial 
test. Our ratings should reflect the extent to which mem- 
bers of an organization practice the principles which the or- 
ganization exists to propagate. For example, to use Elmer’s 
illustration,’ the degree to which the members of a farmers’ 
organization to promote crop rotation actually practice rota- 
tion of crops is the final test of the functioning of that organ- 
ization, In the same way, the proportion of people who own 
their own homes, and the amount and distribution of mort- 

6 Cf. L. A. Boettiger, “The Wisconsin Better Cities Com, Soc. Forces, 


5:436-46, 1927. 
Op. cit, p. 175- 
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gages, are more vital social indices for some purposes than 
the volume of building or per capita wealth. The tax rate 
is an important item, but it is relative to the goods and serv- 
ices furnished by the government. The qualifications, num- 
ber, and pay of health officials, and the per capita expendi- 
tures for health are important indices, but the vital measure 
of the health department is the community’s freedom from 
disease, its low death rate, if and when traceable to the func- 
tioning of the department. A large per capita park acreage 
may be a suggestive index. But not until we consider the 
distribution of that acreage, i.e., the per capita acreage for 
smaller units than the whole city and relative to density 
of population and other factors, do we have a really vital 
social index. Likewisejit may be that the average expendi- 
ture for police protection may be much lower in cities of 
2500 than the average cost of such protection in cities of 
100,000. But it does not necessarily follow that persons and 
property are safer in large cities because they spend, say, five 
times as much per capita for police protection. An index 
of per capita expenditure for police must be considered in 
relation to other indices—the amount of crime, arrests, etc. 

2. Our insistence on functional units and indices does not 
mean that other indices are not important or that they are 
to be neglected. In many cases we can make a valid inter- 
pretation of the functional data only by considering the 
more static and frequently more objective institutional data. 
The mere fact that crop rotation is general in a certain area 
does not of itself prove that the farmers’ organization is re- 
sponsible or that this organization is functioning vigorously. 
Only when we find that a large percentage of farmers belong; 
that their meetings are frequent and well attended, and that 
the programs have devoted much time to the subject of crop 
rotation can we assume with some assurance that the organ- 
ization is responsible for the result observed. The mere fact 
of a low morbidity and mortality rate does not establish the 
efficiency of a health department. But when we find a highly 
qualified and well-paid health department personnel, well 
organized and equipped, correlated with such low rates, 
other conditions being constant, the presumption of a causal 
connection between the two becomes stronger. Final assur- 
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ance as to causal connection must wait upon intensive com- 
parative analysis of the relevant facts in different cities. 
Thus the demographic and structural data of a non-func- 
tional type are very important in three respects: (a) they are 
frequently the most available and objective; (b) when ex- 
perience has shown that a high degree of correlation exists 
between the efficiency of an institution and the personnel 
qualifications, per capita costs, and other population ratios 
of a non-functional type, these data become in themselves 
reliable indices of the vital functional facts in which we are 
primarily interested; (c) demographic and structural facts 
are necessary for the proper interpretation of functional facts. 
3- Whether the data called for in the scale are of the 
demographic, institutional, or functional type, there are cer- 
tain principles governing the choice of units which apply 
equally to each type of data. For purposes of comparability, 
which is the essence of a scale, the units should be stated in 
terms of rates, ratios, percentages, or standard index num- 
bers. Such ratios as the number of children in attendance 
per teacher, the library circulation per capita, the percentage 
of families owning their homes or engaged in paying for 
them, and specific death, birth, and marriage rates, are ex- 
amples of the most useful types of units to be employed in 
a sociometric scale. Some of these units (e.g., death and 
birth rates) should usually be averaged over a period of years, 
and the average employed for comparative purposes, rather 
than selected for the more or less immediate present. The 
period of time to use must be determined by:the purpose of 
the particular scale or the particular investigation, and by 
the tendency of the data in a particular field to fluctuate. 
4. We have already pointed out that the relative weights 
which are to be assigned to different factors in an activity 
must be left to the combined opinion of persons best quali- 
fied to judge. But these judges will be guided in their as- 
signment of weights by the consideration of what aspects of 
the situation it is desired to stress. For example, if it is de- 
sired to emphasize the preventive rather than the curative as- 
pects of a health department, public sanitation will be 
weighted more heavily than, say, the number. of clinical 
visits or the hospital facilities. Likewise in the schedule em- 
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ployed in the Wisconsin Better Cities Contest, education 


was allowed 4000 points as against 1000 points for all other 
divisions because of the assumed preventive value of this 
aspect of community activity, and the desire to emphasize 
the value of preventive activities. Again, this schedule ap- 
parently aimed at emphasizing recognized standards of insti- 
tutional organization rather than the functional aspects. 
So, out of a total of 190 points for the section on Family 
Social Work, 75 points were assigned to the question whether 
a case worker was employed “who has had a course in a regu- 
lar school of social work, or in a university department of 
iology, or who has had experience on the staff of a stand- 
amd case work agency.” The actual amount and efficiency 
of the family social workycarried on by persons without these 
qualifications was scored relatively lightly or was ignored 
altogether. The assignment of weights, therefore, will be 
largely governed by our focus, our purposes. 
: In weighting, the question arises how to treat apparently 
incompatible indices. A high rate of taxation is burdensome, 
and from one point of view should operate unfavorably in 
the score of a community. Yet high taxation may be the 
means whereby exceptionally good schools, playgrounds, and 
health activities are maintained. Accordingly the numerical 
score will be largely determined by the relative weight we 
assign to these two aspects. In a case of this kind, both in- 
dices should be included as direct measures without refer- 
ence to each other.* Each will then offset the other as far as 
the facts justify. If the tax rate is above the average, the 
score of the community will be lowered. But if it is getting 
full value for the expenditure of its public funds, its unfa- 
vorable score on taxation will be offset by its favorable score 
in other respects, to the extent that these are above the 
average. 

5. In general, the same tests of validity of a scale which 
were discussed in the preceding chapter are applicable to the 
sociometric scale, (1) The simplest and crudest test is a com- 
parison of the scale score with the empirical judgment of 
persons familiar with the units scored. This does not, of 
course, proveanything conclusively regarding either the scale 

+M: C. Elmer, op. cit., pp. &, 86 
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or the empirical judgments. A wide disparity in the two 
types of rating, however, may indicate the need for further 
inquiry into the validity of the scale. (2) The correlation 
between the scores of different scales applied to the same 
unit provides one kind of test of the validity of any or all of 
the scales thus correlated. (3) The general applicability of 
the scales can be determined only by application to a large 
number of units of considerable variety. For example, if we 
desire to test the validity of a sociometric scale for cities, we 
might apply it to a considerable number of cities of different 
sizes. We might then group the cities according to size and 
compare the average scores and dispersion for each grou 
If we find considerable variation in the average and disper- 
sion of different groups, it indicates that our scale, although 
valid for some groups, is not generally applicable to as varied 
a group as that to which we have applied it. If this appears 
to be the case, we may either (a) abbreviate the original scale 
so as to retain only such indices as are found to have no cu- 
mulative variation for size of the unit measured, or (b) con- 
struct a separate scale for each population group within the 
limits of which the scale appears to be valid. Which of these 
alternatives to adopt must be governed by our specific pur- 
pose in each case. 

6. As in the case of all other tests, no matter how carefully 
prepared, the final revision and validation of the sociometric 
scale must rest on an actual trial in the field. Such a test is 
important in several ways. First, it permits the elimina- 
tion of possible overlapping of parts of the scale. If it is 
found that the score for several different factors of the units 
measured correlates consistently to a high degree in a large 
number of cases, then it is evident that scoring on any one 
of these factors alone is a satisfactory measure also of the 
other factors, to the degree that the correlation between 
them has been found to exist. Thus Chapin® found that a 
comparatively simple schedule for rating the socio-economic 
status of urban middle-class families could be constructed, 


9F. S. Chapin, “A Quantitative Scale for Rating the Home and Social 
Environment of Middle Class Families in an Urban Community,” Jour. Educ. 
Psychol., 19:99-111, 1928. See es ially the later revisions of this scale, the 
results of which are reported in F. S. Chapin, Contemporary American Insti- 
tutions, (Harpers, 1935), Chap. 19- 
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and that the results correlated to a high degree with the 
` scores obtained by several other more complicated schedules. 

Second, the actual testing of the schedule in the field 
shows us which of the data called for are not available, are 
very difficult of access, or are too unreliable to warrant their 
use. The schedule must then be revised to eliminate these 
items, and, if possible, to include other more available data 
on the same aspects. 

Finally, it is by actual use of the schedule in the field that 
the data necessary for the establishment of norms become 
available. We find in this way the average or modal score 

large number of organizations as well as the modal score 
each aspect measured. The deviation of a particular case 
from this norm indicates its departure from the average, i.e. 
from “reasonable” status or attainment. The dispersion in 
a distribution of scores of a large number of cases also en- 
ables us to determine the degree of reliability, i.e., the prob- 
able error in measurements, made with our scale. 

The best way to gain familiarity with the technique of 
construction of sociometric scales is to study the detailed 
steps involved in the construction of actual scales. Having 
considered at some length the principles governing the con- 
struction of sociometric scales, we shall now consider briefly 
some concrete examples of scales which have been developed 


to measure two different types of phenomena, namely, (1) 
health conditions and (2) socio-economic status. 


F. THE MEASUREMENT OF HYGIENIC BEHAVIOR 


The following exposition of the technique of constructing 
a scale for the measurement of hygiene in rural villages in 


Syria is admirable for its clarity, completeness, and attention 
to scientific standards : 1° 


1. Specifications 


a. Validity. A scale to be valid must measure in some demon- 
strable way what it claims to measure. This scale claims to meas- 
ure (1) hygiene (2) of families (3) in Syrian (4) villages. 

(1) To prove that the scale measures hygiene its scores must 
correlate with some index of health in a group of people, or com- 


10$, C. Dodd) Controlled i iene in Syri 
Univ. Pres. <r bees <a Experiment on Rural Hygiene in Syria, (Oxford 
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munities. High correlation coefficients in a population of defined 
range or variation are a necessary condition for such proof. But 
they are not a sufficient proof. For if two series of phenomena are 
causally related they must be correlated, but they may be corre- 
lated without any relation of cause and effect between them. In 
addition to the mutual association demonstrated by a correla- 
tion coefficient, there must be relation in time of antecedent and 
consequent, to show a cause and its effect. In interacting social 
phenomena it is difficult to separate two phenomena in a clear- 
cut test such as by making each act alone and see if the other 
follows. Fortunately, sociologists may build on the medical 
sciences, in which the causal relation of much of hygiene to 
health has been established experimentally. Therefore the agree- 
ment of medical experts that certain practices are “hygienic” ones 
and their correlation with health indices will be conside: 

necessary and sufficient proof of the validity of the scale.’ 

Next, the claim of the scale to measure family hygiene must be 

roved by showing that a score represents the practices of the 
amily as a whole and is independent of the deviations of individ- 
ual members. If the scores obtained from two different in- 
formants in each family correlate high it will be a proof of this 
claim. 

The scale must be appropriate to conditions in Syria and 
similar regions of the Near East. For example, it must deal with 
anti-malarial practices, and not anti-yellow fever practices, as the 
former sickness is an important one here, while the latter is non- 
existent. 

As conditions vary in different parts of the Near East, flexibil- 
ity of the scale in order to adapt it to local conditions is essential. 
The answers to the questions must be recorded independently of 
any weighting in scoring in order to enable a different scoring 
or weighting system to be used in another locality where condi- 
tions may be different. 

The scale must fit rural conditions. It must be possible to 
record and score all the variety of conditions, practices, and 
knowledge conducive to health which may be found in the vil- 
lages. The distribution curves must not show a “pile up” of 
cases at either end due to the scoring system having a “cellar” or 
a “roof” which kept those cases from being spread further down- 
wards or upwards. Validity in this respect will be proved if all 


11 “As some practices are of dubious or controversial value in medical opin- 
ion it is well, in case of doubt as to whether to include such items in the 
scale, to err on the side of over inclusion. It is easy to reject all such infor- 
mation in statistical treatment, but impossible to treat it if the original 
schedule did not secure it. Therefore the scale should include many items 
to make possible the later sorting of the more valid from the less valid. 
Minuteness in observing data is essential in science, although this minute- 
ness goes far beyond the needs of social workers who may want to use such 
a schedule card later.” (Ibid., p. 2.) 
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answers given by Syrian villages can be recorded and scored on 
the scale without modes at either extreme of the frequency dis- 
tribution graphs. 

b. Reliability. To be reliable any measuring instrument must 
give the same results when reapplied to the same phenomena. 
Discrepancies in results are errors of measurement. If these errors 
tend to be in one direction they are constant, or one-way errors, 
but if sometimes in one direction and sometimes in the other, 
they are variable, two-ways, errors. 

Constant errors are measured by differences between the aver- 
age score of one observation, or survey, of a group and the 
average score of a second survey. This difference when divided 
by its standard deviation becomes the significance ratio. If this 
ratio is less than 3, scientific convention agrees that the observed 

r may be attributed to chance (sampling error). If the ratio 
i ve 3, the probability of its occurrence by random fluctua- 
tions of sampling is approximately one in a thousand at most. . . 
This is so see as to mean with a high degree of prob- 
ability that the observed difference is a “significant” constant 
error. 

Variable errors may be measured by the correlation coefficients 
between the two surveys. A coefficient over .g5, will here be con- 
sidered to indicate high reliability or small variable error. 

In order to eliminate the constant and the variable types of 
error, they may be analyzed into the following categories, classi- 
fied by sources. 

(1) Sampling error. A frequent source of error is from small- 
ness of the sample. The formulas for the probable error, or for 
the standard error, of sampling measure the size of this error. 
To be reliable or “significant” all indices based on this scale 
must be larger than three times their standard error. 

(2) Seasonal error. The smaller errors due to differences be- 
tween informants, surveyors, etc., although included in the sam- 
pling error, may be analyzed out separately. It is better experi- 
mental technic to eliminate the component factors than to 
depend on the fact that sampling fluctuations tend to cancel 
each other when the sample is sufficiently enlarged. 

As the scale seeks to measure the hygiene of a family through- 
out the year, it should not give different results at different 
seasons. For-a reliable scale the results of surveys at different 
Seasons must show a negligible constant error (as measured by 
the significance ratio) and a negligible variable error (as meas- 
ured by the correlation coefficient). 

(3) Informant error. The amount of informant error consid- 
ered permissible is when the correlation between the scores de- 
rived from different informants in each family exceeds .go. It is 
possible to group the scores from men and women, or from adults 
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and minors, or from married and unmarried, separately, and thus 
determine the bias that sex, or age, or marriage, may introduce 
into the informant error. 

(4) Interviewer or surveyor error. The permissible amount of 
error due to different surveyors interviewing a family may be 
specified as a correlation coefficient of not less than .go between 
the scores from different surveyors. Where surveyors work in 
pairs this error may be split into a questioner error and a re- 
corder error. The questioner error may be determined by repeti- 
tion of the schedule on a group with a different questioner? The 
recorder error may be determined by having two or more record- 
ers listen to the same informants and record the replies simul- 
taneously but independently. 

(5) Scorer error. The permissible amount of error due to 
different scorers may be specified as zero. Differences between 
different scorers can, and should be, completely eliminated’ by 
using the proper technics, hereinafter described. 

(6) Schedule error. Ifa schedule putting the questions in dif- 
ferent form were made up, after the manner of “alternate forms” 
of an intelligence test, the score of a family might become differ- 
ent. This schedule error, in its constant or variable form, is 
measurable by either the significance ratio or the correlation co- 
efficient. 

The sampling error does not always include the seasonal, in- 
formant, interviewer, scorer, and schedule errors. If in one set 
of scores there are included variations in season, informant, etc., 
the sampling error includes these. But if the set of scores ies 
a seen a error constant, as in surveying at one season only, 
or with one interviewer, or with one scorer, or with one schedule 
only, then this error is not included in the probable error of 
sampling nor measured by it. This exposition suggests how these 
contributing errors may be eliminated and also how they may be 
measured separately or in combination. 

A reliable scale should reduce all errors to a minimum, and 
then measure the amount of that minimal residual error. Only 
when the sree of accuracy of a scale is thus known can data 
yielded by it be of scientific value. 

c. Objectivity and quantitativeness. Throughout the above 
specification of the reliability, (which has been set as a goal to be 
achieved by the constructor of this scale), there have been two 
principles implicit, but which need to be emphasized as corollaries 
of the specification of reliability. The first is that all the items 
must be cast in objective form with answers in definite printed 
categories to be checked such as “Yes” or “No,” “1,” “2,” “3,” 
e.g., “Men ( ): Cereals ( ), Milk ( ), Vegetables ( ), Meat ( ), or 
Fats (),” etc. If subjective and relative answers such as “Very 
much,” “Good,” etc., are to be inserted by the surveyor at his 
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own free will, there will be no standardization for the discovery 
of uniformities. Also differences between surveyors will increase 
the surveyor error enormously. 

The second implied specification is that the data must be 
“quantifiable” — to coin a term expressing the idea that while 
the raw data may be secured in qualitative form it must be ca- 
pable of being converted into quantitative form. Appropriate 
statistical treatment can then in scientific fashion induce pat- 
terns, interrelations, and uniformities with known limits of 
error. Exact study of the phenomena and simultaneous refine- 
ment of the instrument of observation then become possible. 

d. Administration. Next ir importance to the specifications 
of scientific validity and reliability of the scale are ease and 
ate of its administration. 

(1) Self-instructing. The schedule that is put into the sur- 
veyor’s hand should contain all necessary directions for its use 
and for ruling on unusual cases as they occur. These directions 
should accompany each item and should not involve consulting 
an index or manual of instructions or otherwise distracting the 
surveyor’s attention from his conversation with his informant. 

(2) Lay surveyors. The technic should be simple enough so 
that secondary school graduates may be trained in a week to 
carry it out. It should not require a doctor or trained nurse to 
administer. 

: (3) Acceptability. The questions should be acceptable to the 
Informant, the government, the priest and other influential per- 
sons whose opposition, if sense, would obstruct the gathering 
of data. To be acceptable to the informant questions of sex 
practices, income, ages in the zone of military conscription and 
some others must be avoided. To be acceptable to the govern- 
ment, questions of political import as to taxes, reactions to 
official steps towards sanitation or absence of them must be 
carefully phrased. To avoid other opposition, religious or other 
prejudices must not be unduly excited. Thus there should be 
no appearance of doubt as to the value to health and mortality 
of the religious pronouncements in regard to eating pork, cir- 
cumcision, etc. ‘To put this specification positively the schedule 
should win the cooperation o everyone through obviousness of 
its relation to health and the reduction of disease —a universal 
human desire. 


(4) Independence of language. The schedule must be capable 
of being fi led in from interviews in any language. It must not 
depend for its reliability on standardized phrasing of questions 
to such a degree as to become invalid when translated. 

(5) Inexpensive equipment. Aside from the inevitable cost of 
transporting and maintaining a field personnel, the survey should 
keep costs of such equipment as is consumed, or used, down to a 
few piasters or a few cents per schedule filled in. 
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(6) Brevity of interviews. In order to increase the number of 
schedules filled in per daysand so reduce the cost of the field 
personnel, the interviews should not require more than an hour 
apiece. 

(7) Convenience for statistical study. The scale should be so 
printed and arranged as to facilitate recording, scoring, tabulat- 
ing, correlating, etc., with a minimum of errors of eye and 
maximum of clerical speed. . . 


2. Technics of construction 


a. Acquisition of background of rural culture. To construct a 
scale with the above specifications, the author went to live for 
several periods of a week at a time in the villages of northern 
Syria. He also visited villages in Egypt, Palestine, Iraq, and 
Persia. He studied the Bedouins:n the desert, and the Kurds in 
their tent encampments in the mountains. He visited families 
and drew them out in great detail on topics of health and sick- 
ness. His command of Turkish enabled him to have direct con- 
versation in some Turkish speaking villages, while in the Arabic 
speaking villages he carried on most of the conversations through 
an interpreter. He accompanied the doctor and the nurse of an 
itinerant clinic on their visits, and in their systematic medical 
and physical examinations of every soul in two villages. He 
lectured, sometimes with the lantern, before the men and before 
the mothers on hygiene, leading them on to discuss their diffi- 
culties and points of view. He slept in their homes and in their 
tents and observed the habits of the family as to washing, defe- 
cation, sleeping, eating, and other routine patterns of life. He 
sat at the feet of the village barber, midwife, sheikh who dispenses 
charms, peddlers selling drugs, and occasional rural doctors, in- 
ducing them to instruct him out of their wisdom as to the details 
of plying their trades. 

From these experiences and the reports of private and gov- 
ernment clinics a list of the chief diseases was drawn up. Next, 
lists were prepared of the items, or patterns, of knowledge, prac- 
tice, or environmental conditions, which seemed to be related to 
each disease or to health in general. In preparing this list it was 
necessary to avoid rejecting all folk practices as superstitious on 
the one hand, and to avoid accepting all western medical advice 
as sound in the given situation on the other hand. Folk rem- 
edies are occasionally 13 shrewd cullings from the experience of 
generations, as in the use of pumpkin seeds for worms. 


114 Striking historic examples of a kernel of truth in the chaff of folk super- 
stitions are the discovery that fever is a remedy for incipient paretic paralysis 
from the practice by an African tribe of exposing such cases to nights in a 
malarial swamp ; and the discovery of digitalis as a heart drug from the folk 
seputation of a certain witch's brew of herbs. 
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Conventional oo advice of the west may backfire in a 
given situation. Thus taking baths in the village may be a posi- 
tive menace to health. The villagers’ idea of a bath is a steam 
heating process, the “Turkish bath,” which requires facilities for 
slow cooling off. The mud hut has no such facilities and leaves 
the perspiring bather helpless in drafts. Experience seems to 
have given them the conviction of the woman who replied to the 

uestion “Why don’t you bathe the baby?” by ee “Why 
should I kill it with pneumonia?” Hygienic slogans such as 
“Bathe often” need to be either discarded, or followed up with 
a new and detailed technic of bathing which adapts the rule to 
the local culture. 

b. Trial of preliminary form of the schedule. From the lists 
thus compiled a trial list of questions was hectographed on a 
schedule card. After each question there were printed all the 
probable replies. Spaces were provided after every reply for 
checking those which might be encountered in a given family. 
These trial questions were taken to two villages and tried out. 
The doctor and the public health nurse in charge of a village 
clinic went over them carefully. A class in Methods of Social 
Research, riggs dean of college seniors and graduates, was first 
instructed in the use of the schedule, then practiced it on some 
cases, and then turned in written criticisms. 

¢. Development of Form A and Manuals of Instruction. A 
revised schedule, labeled “Form A” emerged from these technics 
of criticism. It comprised two hundred and seventy questions. 
Most of these questions required subsidiary questions to elicit 
full information. About two thousand answers were printed to 
be checked affirmatively with a “v” or negatively with an “o.” In 
addition, spaces were provided after questions calling for nu- 
par ee hesla The questions were grouped in twelve sections, 


Section r. Identification data and vital statistics, including 
data on morbidity. 

Section 2. Remedies for sicknesses represented by cuts, measles, 
typhoid, smallpox, colds, fevers, malaria, headaches, abdominal 
pains, diarrhea, constipation, childbirth, and venereal disease. 

Section 3. Cleanliness — questions on the frequency an 
pre of cleaning the person, the clothes, and the eating 

Section 4. Sex— questions on sex practices and hygiene. These 
were not to be filled in routinely but only where confidential re- 
lations could be established as the questions were very intimate 
and touched on local, as well as universal, tabus. 

Section 5. Food—an inventory of articles of diet within one 
year and the menu for twenty-four hours. 

Section 6. Water — questions bearing on the source, amount, 
and purity of the family water supply in summer and in winter. 
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Section 7. Infants—a schedule on the infant's physical condi- 
tion, diet, cleanliness, sleeping, and other habits. 

Section 8. Property—an inspection of the rooms of the house, 
the yard and the stables. 

: Section 9. Waste — questions on the disposal of garbage and 
eces. 

Section ro. Insects —inquiries as to the family’s knowledge 
about, and practice in dealing with, mosquitoes, flies, lice and 
bedbugs. 

Section rz. Sleep — questions on habits and facilities connected 
with sleeping. 

Section r2. Heating—an inventory of equipment and prac- 
tices for cooking and keeping warm in winter. . . 

d. Construction of the scoring system. The answers on the 
schedule were recorded in terms of numbers, yes, no, and similar 
objective categories. To convert them into a single quantitative 
scale, credit points were assigned to each answer in a rough at- 
tempt to weight its hygienic importance, or relative contribution 
towards reducing the specific morbidity rates and general mortal- 
ity rate in the local situation. 

A thousand points were considered to provide ample discrim- 
ination. The worst hygienic conditions see were considered 
to set the zero end of the 1000 point scale. The best, or model, 
hygienic conditions which might be achieved with present knowl- 
edge and present economic standards of living in the country 
was set at the other extreme. Since unhygienic and even danger- 
ous practices were given zero values, some positive credit was 
given to practices of neutral hygienic value. Thus the applica- 
tion of something generally aseptic, as urine, on a cut received 
less credit than the application of some antiseptic, but more 
credit than the application of something septic, such as scrapings 
of shoe leather. This in no way changed the relative credits 
given to hygienic, neutral, or unhygienic practices but only 
shifted the zero point of the arbitrary scale so that the scale dealt 
only with positive credits. This eliminated the more complicated 
algebra required in dealing with both positive and negative 
credits. 

A panel of judges was then selected to allocate the number of 
points out of the thousand that should be assigned to each sec- 
tion of the scale. After full discussion they voted independently 
and the average number of points was found. These averages 
were slightly modified to round them off in order to make the 
total of the averages for the sections add up to 1000. Then each 
judge independently distributed the points, allocated to a section, 
over the individual questions of that section. Again the average 
allotment to each question was determined and modified to make 
the total come to the number of points assigned to that section. 
Then, the points allotted to each question were distributed over 


R 
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the possible answers by a similar process. Provision was made for 
answers of equal hygienic merit which are alternatives (such as: 
the use of different vermicides ; or the feeding of orange, tomato, 
or grape juice to babies, depending on availability). 

Very often the points could not be assigned in a block—so 
many for a positive answer and nothing for a negative one — but 
were more suitably assigned in degrees. Thus for the seven 
poan assigned to the use of soap (Question No. 62) the scoring 

ormula was one point for every ten liters used annually. Again 

the hygienic value of certain acts depends on their frequency. 
Thus the five points for washing clothes were distributed : 5 for 
washing oftener than once a week, 4 for weekly washing, 3 for 
fortnightly, 2 for monthly, 1 for seasonally and o for never wash 
ing. 

The allotment of points depended on subjective judgment. 
No more objective method seemed possible until statistical anal- 
yses, correlating the data with the criteria, should reveal weight- 
ings which corresponded more closely with objective reality. To 

uce the variability of the judges’ weightings (a) resvi judg- 
ments were averaged and (b) the judges were chosen for expert- 
ness in some aspect of the field of hygiene... All the judges 
participated in allocating points to the twelve sections and, (with 
one or two exceptions) to the questions within the sections. ‘The 


first three judges then allocated the points within a question to 
its various answers. 


The schedule prepared as described above was next pre- 
tested in ten villages and a city sample. The scores secured 
were graphed (a) for the whole scale, (b) the twelve section 
scores, and (c) for each of the 270 questions. These graphs 
were studied (a) to detect ambiguities in the questions ; (b) 
to develop better categories in which to record the answers} 
and (c) to discover the range of answers so as “to provide for 
them adequately on the revised schedule without having a 
‘roof’ or a ‘cellar’ which arbitrarily curtailed the distribu- 
tion at one extreme or the other.” 1? The graphs of six sam- 
ple populations were further studied intensively in order to 
detect (a) defects of sampling, (b) spring-summer, (c) rural- 
urban and (d) Arab-Armenian differences, (e) the range of 
replies, and (f) unusual cases which might be suggestive as 
hypotheses. Each item was then considered separately from 
the standpoint of objectivity and its liability to any of the 


13 Ibid., p. 20. 
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types of error considered above, for the purpose of selecting 
the items to be included in the final form of the scale. Fi- 
nally, tests of the validity of the scale were carried out 
through the correlation of its scores with such indices as 
morbidity and mortality rates, and income. On the basis of 
all of the findings from this pretest, the entire scale was re- 
vised by adding some items, dropping others, and modifying 
still others. The result was a scale which conformed in a 
remarkable degree to the criteria set up at the outset. 


G. A SCALE FOR THE APPRAISAL OF HEALTH WORK 


A very different instrument from that described above but 
designed to measure somewhat the same general phenomenon 
in a radically different culture is the Appraisal Form for City 
Health Work developed by the American Public Health As- 
sociation. The story of its origin has been summarized as 
follows : 


The desirability of some concrete standard of organized health 
work against which the work of communities should be compared 
has been appreciated for some time and several attempts have 
been made to construct a form that would serve as a health yard- 
stick. More than ten years ago Dr. Charles V. Chapin, Health 
Officer of Providence, proposed a measure as the outcome of his 
study of state health departments. Later Mr. Franz Schneider of 
the Russell Sage Foundation prepared a table of relative values in 
health work. Dr. F. W. Sears of the New York State Department 
of Health developed a score card for rating city health work. 
These various standards embodied for the most part the personal 
opinions of individuals and of course there were many who held 
different opinions. To produce a document of some permanence 
there was needed something more than the judgment of an in- 
dividual, even though that individual might be eminently fitted 
to express seasoned judgment. 

In 1920 the American Public Health Association through its 
Committee on Municipal Health Department Practice, began an 
intensive study of the work of city health departments. At the 
request of that committee, Dr. Chapin submitted a statement 
of standards and relative values in April, 1923. The committee 
had given much thought to these proposed standards and after 
amending the statement to harmonize with its collective judg- 
meni, approved it for experimental use by the Field Director, 
Dr. W. S. Rankin. Through contact with many city health 
officers, the Field Director incorporated many constructive sug- 
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gestions in the rating schedule as it appeared when it passed the 
committee." X 


The American Public Health Association and the Ameri- 
can Child Health Association then got together and com- 
bined their respective plans. This statement of standards 
was then changed to meet the interests of the American So- 
cial Hygiene Association, the National Tuberculosis Asso- 
ciation, the National Organization for Public Health Nurs- 
ing, and the National Health Council. It was further 
modified in conference with three groups of city health offi- 
cers from New England, Ohio, and Michigan, respectively. 
Final decision on each item was left to the vote of eight of 
these officers, representing a total of thirty-four in all. 

The result was the first edition of the Appraisal Form. 
“After these lengthy preliminary conferences, after one full 
year of trial use by a group of nearly fifty cooperating city 
s health officers, and after the application of the Appraisal 

Form to the study of the data for 186 cities collected by the 
United States Public Health Association surveys, a second 
general conference was held in New York in September, 
1925, in which the Committee on Municipal Health Depart- 
ment Practice, the group of nine health officers representing 
cooperating officials in New England, Ohio, and Michigan, 
and the American Child Health Association, and other na- 
tional voluntary health agencies, all took part. On the basis 
of recommendations made by this conference the committee 
has revised the Appraisal Form and believes that it is now 
sufficiently satisfactory to be issued with the approval of the 
committee as a basis for the appraisal of city health work.” * 
The form thus revised was issued as the Second Edition of 
the Appraisal Form. A Third Edition, further revised, ap- 
peared in 1929. A special form for rural health work ap- 
peared in 1932. Finally the Appraisal Form for Local Health 
Work (first edition, 1938) 14 was developed in a form ap- 
plicable to both rural and urban units, The accompanying 


illustration (Fig. 6) is taken from this latest edition. 

1 A Health Survey of 86 Cities, Ameri i iation, R 
search Divisi New York, 19, bp. = =a Child Health Association, Re 
Pis ne for City th Work, Second Edition, Introductory 

“American Public Health Association, 1790 Broadway, New York City. 


a dinae 
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The aim of the appraisal form is to provide “a brief sched- 
ule which will yield a reasonably accurate picture of the 
health services of a community as evidenced by certain typi- 
cal sample activities. As originally developed, it was in- 
tended to cover primarily those services performed as organ- 
ized functions of the health department or allied health 
agencies in a community. As it has developed, each revision 
has extended its original scope to a degree. The appraisal 
is deliberately not based on money expended or personnel 
employed, as these indicate resources rather than perform- 
ance. Mortality rates have not been used directly as a base 
for rating since they are affected so frequently by racial, in- 
dustrial, and other factors that they have no absolute value 
as standards of rating and their use makes comparison of 
communities frequently misleading. The idea in rating has 
been rather to measure actual accomplishment—as indi- 
cated by the extent to which: vaccinations are performed, 
infants are under medical supervision, physical defects of 
school children are discovered and corrected, cases of tu- 
berculosis are hospitalized, vital statistics are properly re- 
corded and analyzed, laboratory tests are performed — with 
confidence that the honest performance of such activities 
ultimately leads to the conservation of human life and effi- 
ciency, which is the goal of all public health work.” 

The aspects of health work included in the Appraisal 
Form, and the relative weights assigned to each aspect are 
as follows: 


Weighting Factors 

Vital statistics 40 
Communicable disease control 160 
Syphilis and gonorrhea control go 
uberculosis control go 
Maternity hygiene j go 
Infant and preschool hygiene 170 
School hygiene 140 
Sanitation go 
Food and milk control 80 
Bonus for balanced program 50 
Total Weighted Score 1000 


15 Appraisal Form for Local Health Work, 1938, p- vii. 
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Each of these general divisions of health work are divided 
into from five to eight sub-categories such as (a) reporting 
and registration, (b) clinical service, (c) field nursing service, 
(d) institutional care, (e) laboratory service, (f) community 
health instruction, etc. After each item, there appears in the 
column “Value of item” a figure which indicates the relative 
importance of this item in this part or section of the health 
program (e.g., tuberculosis control), as determined by group 
judgment. Each section is assigned a total of 100 points. 
The final score of any community is obtained by multiply- 
ing the actual score obtained in a given section by the proper 
weighting factor for that section as given above. The weight- 
ing factor expresses the relative value of that activity (e.g., 
tuberculosis control) in the whole health program. 

The accompanying extractsfrom the Appraisal Form will 
illustrate the method of scoring. It will be noted that stand- 
ards are provided in each case. These standards do not rep- 

resent ideal or Utopian norms, but reasonable, attainable 
standards under the present status of knowledge and devel- 
opment of public health work. The standards are not imag: 
inary or theoretical, but are based on conditions actually ex- 
isting at the present time in the large number of cities from 
which data for this schedule have been drawn. It will be 
noted that the data called for are always objective and quan- 
titative. Finally, this schedule places the major emphasis on 
the functional aspects of the city health department — the 
actual work performed rather than the presence or absence 
of machinery for doing the work. For these and other rea- 
sons, the schedule is an excellent concrete illustration of the 
principles laid down earlier in this chapter regarding desir- 
able characteristics of this type of sociometric scale and the 
methods of its construction. 

The practical usefulness of an appraisal form of this kind 
has been admirably summarized by the American Child 


Health Association on the basis of the experience of a large 


number of cities: 


As a brief for annual a propriations of the Department of 
Health. In several Cities the rating schedule has been used to 
show in a comparative, objective manner certain deficiencies in 
the department of health, and the expenditures which would be 
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required to secure a full credit in place of these deficiencies. 
When presented in this manner to the appropriating body of the 
city, as a part of the health officer's alee request, the health 
department was, in one instance, the only department of the 
city government to receive an increase (amounting to 14 percent) 
for the coming year. In another instance, in the face of a strong 
movement for retrenchment throughout the city government, the 
health budget, presented in the light of the rating schedule, was 
the only departmental budget to remain uncut. 

As a basis for a Health Program. The value of the rating 
schedule as an outline of a well balanced health program has ap- 
pealed to several health officers to such an extent that their future 
programs are definitely based upon achieving the standards set in 
the schedule. . . 

As a basis for an Annual Report, It has occurred to one health 
officer to incorporate in the annual report of his health depart- 
ment the detailed rating of his city . . . and to publish the score 
attained with full discussion of many of the items. Another 
health officer has largely improved the character of his annual 
report in the light of the Appraisal Form. 

As a means of interesting a Mayor or Chamber of Commerce. 
In a city which had recently scored itself, the health officer was 
studying the rating when the mayor of the city happened to enter 
the office of the health department. Upon being shown the low 
rating of the city in diphtheria prevention the mayor inquired 
what could be done to improve the situation. When the health 
officer explained that he could secure a full score (30 points) if he 
had $175 with which to buy toxin-antitoxin, the mayor promised 
to make the sum available from special funds. Other health 
officers have taken the rating of their city to the mayor or the 
chamber of commerce, and, by comparing the local rating with 
that of other cities, have awakened a new interest in the city’s 
health record. 

As a means of arousing and cementing the interests of the De- 
partments of Health and Education and of the private agencies 
in each. A striking outgrowth of the use of the rating schedule 
in one city where the findings were m porten in a special meeting, 
was the development of a new sense of the interdependence of all 
health agencies on each other and the formation of a joint health 
committee on which all agencies were represented under the lead- 
ership of the health officer. In another instance the rating 
schedule brought to the health officer for the first time the 
realization that he should cooperate with the department of 
education in its health work if his city was to secure the fullest 
return from its health activities. School medical inspection was 
promptly started.1¢ 

16 Reprinted from A Health Survey of 86 Cities, (New York, 1925), P- $- 
By permission of the American Child Health Association, New York City. 
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SURVEY SCHEDULE 
ALES AEE AS tithe ata ll 


SANITATION ACTIVITIES 


I, GENERAL SANITATION, INCLUDING WATER SUPPLY 
AND EXCRETA DISPOSAL 


The function of the health department in this field as in communicable disease 
control is rapidly swinging from one of law enforcement to one of education. It 
is only within recent years that the personnel of a health department resporaias 
for the control of environmental sanitation has ceased to be known as “ Sanitary 
Police.” They are still enii referred to as sanitation inspectors and ‘occa- 

as sanitation officers. ile state and local laws frequently provide the 
basis for the control program, recently developed sanitary codes covering the 
fields pona the necessary scientific and technical guide posts for a comprehensive 
educa! program, 


39. Inspection Service 


The function of the sanitation” is primarily that of interpreting 
regulations and assisting individuals in complying with both the letter 
s the service must assume 


responsi! r 
fields the educational work of the department should decrease the neces», 
sity for routine ion and the public should take more and mote 
responsibility for maintenance of favorable sanitary conditions. On the 
other hand, when the health department through the establishment of 
service in a certain field attracts public attention to its efforts 
in improving sanitary conditions, the public comes to rely upon this 
Protection, and care must be exercised that this confidence in the 


department is not abused. 
Number of inspections and_reinspecti clusive of smoke 
inspections, inspections, and all a disease work) 


tions 
@ Municipal Male, Number of sites Number of inspei 


Se Le lg a 
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le 


i 


i 
ii 
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plumbing inspections those 
> ous aaan ‘the ‘possbiey of 
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H 


Ficurg 6. Sample pages from the Appraisal Form for Local Health w 
a 


American Public Health Association, N b- - a Ai 
Sanitation Activities. » New York, First Edition, 199 
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4 
APPRAISAL SCHEDULE 


SANITATION ACTIVITIES 


I. GENERAL SANITATION, INCLUDING WATER SUPPLY 
AND EXCRETA DISPOSAL 


39, Inspection Service Vilne of | Item for Value Assigned 


a. Inspections of municipal water supplies . 2 al Sa! iced ha 
Standard: 12 inspections per supply 
per year 
Inspections—12 plus . . . 
GARED ere | 
Less than 6 cero.) . 
Inspections = number of mu- 
nicipal water . supplies = 
swine inspections per supply 
Pale In cities in which the r S ir 
linkova a of the water E suoi is fixed in 


than the 
depart may be satisfied if the health department os 
samples for bacterial analysis at least monthly. 


b. Sanitary inspections . . . 3 3 
Standard: :25 inspections per * 1,000 
population 
Inspections—25 ie + « o 3 points 
20-249 ...2 
15-199... 1 
Less than 15 ee) 
bg ha oman == population 
1000 m= inspections per 
1.000 ‘population. 
c. Inspections for malaria control. . . . 2 
Standard: -'50°-inspections per 1,000 
. population 
Thspeotions— us . . . 2 points 
Wa i” 
Lessthan 30 . ++ 40 
Inspections +. -+ population 
$1000 == ma inspections per 
1,000 population 
than cdaes bot 1000 population, thie iiem should 
be omitted fi Is consideration Py ising. 
d. Inspections of industrial plants, . . . 1* 


Standard: Provision pa re or through 
the state for as working 
conditions in industrial plants 


Ficure 6. (Continued). (b) Sanitation Activities. 


286 SOCIAL RESEARCH 


= SURVEY SCHEDULE 


22. Reporting and Registration—continued 
Number of cases and deaths from tuberculosis reported : 


c. Is there regular and tment of 
reporting to the health de 

Sag a chee en $0 the aren from state or other 

sanatoria? ee 


d. Are Set investigations and case histories made of all cases 
oe vig meaty eee which consider the case in relation to the 
Dena (onoshek and close associates in an effort to locate 

rce and contacts 7 


Ficure 6, (Continued), (¢) Tuberculosis Control. 
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APPRAISAL SCHEDULE 


D. TUBERCULOSIS 


22. Reporting and Registration rene A bl sod Value Assigned 
a, New cases reported last year per annual HD Coe al es 
average number of deaths for past 3 


JORIS args as  whrah TE ter et avi ts 3 3 
Standard: 2 new cases per annual aver- 
age number of deaths g 
Cases — 2 plus . . > 3 points 
1.2-1.9 E 
1.0-1.4 ad 
Less than 1.0 basa 
Cases rted -+ average 
number of deaths mema. = 
cases per death 
b. Known living cases last year annual 
average number of deaths for past 3 
PU a wh a E ew A 3 3 
Standard: 5 cases per annual average 
number of deaths 
Cases — 5 plus . . . . 3 points 
SSAI Jh 
20-34 ..+- 1 
Less than 2.0 PRUDE. 
Known living cases me- -+ average 
number of deat a ae [ee 
known cases per death 
¢. Systematic reporting to health = 
ment of cases returned to area 
state or other sanatoria . . . . + 2 2 


= 


Epidemiological investigation of all cases 


e. Contacts of new reported cases examined 4 4 
Standard: 3 contacts examined per new 


case 
Contacts — 3 plus . . . . 4 points 
2229. | 


- 3 

20-24. s.. 2 

15-19. ...1 

Less thon 15... +. 0 


Ficure 6,- (Continued). (d) Tuberculosis Control. 
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H. THE MEASUREMENT OF’ SOCIO-ECONOMIC STATUS 


Planes and standards of living have always been a subject 
of considerable interest in the social sciences. A large num- 
ber of indices for the classification of families on this basis 
have been developed. The influence of home environment 
on personality and all manner of behavior traits has been a 
subject of protracted discussion. The need for an objective 
method of rating homes as desirable places in which to rear 
children first made itself felt in the field of social work. In 
this connection also a number of rating schemes and scales 
have been formulated. We select as illustrations several of 
the best of these instruments. 

F. S. Chapin in 1928 defined socio-economic status as “the 
position an individual or a family occupied with reference 
to the prevailing average standards of cultural possessions, 
effective income, material possessions, and participation in 
the group activities of the community.” 1! He constructed 
scales to measure each of these components. Cultural equip- 
ment was measured by assigning “arbitrary numerical weights 
toa list of articles such as books; newspapers, magazines, MU- 
sical articles, and other items. The total of these weights 
constituted the culture score. Effective income was meas- 
ured in terms of ammains.” 18 Material possessions were meas- 
ured by articles of household equipment weighted as in the 
first series. An index for participation in group activities 
was constructed on the basis of a detailed record of member- 
ship, contributions, attendance, committee service, and offices 
held in community organizations. Further analysis led to 
the conclusion that the total score given to living-room equip- 
ment might be taken as a fair index of socio-economic status, 
since it correlated highly with the combined score of the 
four indices; and therefore a scale consisting of 53 items of 
living-room equipment was constructed. The scale consisted 
of four groupings of items: Fixed Features, 14 items; Built- 
in Feagures, 5 items; Standard Furniture, 13 items; and, 


"F..S. Chapin, “A Quantitative Scale for Rating the Home and Social 

Environment of Middle Class Families in an Urban Community,” Jour: 

PEdue. Pych, gt = sp E 

_ammain (‘adult male maintenance”) is a unit developed by E 

Sydenstricker and W. L King (“The Measurement of Relative Economic 
tus of Families,” Quart. Publ. Amer. Statis, Asso., 17:842-57, 1921)- 


- 
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Furnishings and Cultural Resources, 21 items. On the basis 
of experience, group two (Built-in Features) was eliminated ; 
and later, in 1931, a group of items designed to reveal the 
conditions of the room and its contents was added.” 1° 

After considerable experimentation and use of the scale 
in various parts of the country, Chapin decided that the 
scale could be greatly simplified. The third revision, The 
Social Status Scale 1933, reproduced herewith was the re- 
sult. Of the original 53 items, 17 were selected because of 
their demonstrated capacity for differentiating between so- 
cial classes. They were assigned new weights so as to give 
approximately the same total as was given by the original 53 
items. The resulting scale showed a reliability of r=.96 on 
a sample of 50 professional men’s homes by the test-retest 
method, and a validity measured by correlation ratios of .57 
with occupation and .44 with income, for 442 homes. Nu- 
merous other tests of the validity of this scale have been ap- 
plied with good results. The following figures, based on 
representative samples, are the standardized mean scores on 
the 1933 scale for different types of homes : ** 


Mean 
Score 
Professional men’s homes (white) in Minneapolis... .. 147 
Boarding homes in Minneapolis (white) approved by 
the Minnesota State Cnildren’s Bureau.....-.....- 
Miscellaneous average homes (white) in Indianapolis.. 123 
Upper-class Negro homes in Minneapoi _ APE ek 107 
Industrial-class homes (white) in urban Alabama.... 55 


Industrial-class homes (white) in urban Massachusetts 50 
Poor relief cases (white) in Minneapolis.......+.+++++ 47 
Poor relief cases (Negro) in Minneapolis........-++- 40 


The present author also tested the validity of the Chapin 
scale by comparing the score distribution of the scale for the 
population of a small village with the common-sense judg- 


19 Wm. H. Sewell, The Construction and Standardization of a Scale for the 
Measurement of the Socio-Economic Status of Oklahoma Farm Families, 
‘Technical Bulletin No. 9, Oklahoma Agricultural and Mechanical College 
ap ae Experiment Station, Stillwater, Okla., April, 1940), p. 1 

F, $. Chapin, The Measurement of Social Status by the Use of the Social 
Status Scale 1933. (Univ. of Minn. Press, 1933- 16 pages, 25 cents.) See also, 
F. S. Chapin, Contemporary American Institutions, (Harper, 1935), Chap. 1 

21 For a full discussion of the validity of this scale, see F. S. C apin, bid., 
Chap. 19. A modification of the weights for the different items in the scale of 
1933 on the basis of a factor analysis is now under way. 
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Pant I. Marzata, Egurpmenr awp Currorat Expression or tae Livine Room or rae Home 


1. Floor, softwood (6) 
hardwood (10) 

2. Large rug (8) exe eee eer l 

3. Windows with drapes (each window 2). 

4. Fireplace with 3 or more utensils (8).....__ 


5. Artificial light, electric. (8) mee mmm 
herosene’{-—2): os 
6. Library table (8)... 


7. Armchairs (8 each)... 


9. Desk: personal-social (8). 

10. Bookcases with books (8 each)——.....___ 
11. Sewing machine (—2) 
12. Couch pillows (2 each) 
13. Alarm clock (—2)_... 
14. Periodicals (8 each)... 
15. Newspapers (8 each). 
16, Telephone (8) 

17 Radio (8) 


8. Piano bench (4) m mniam 


Part II: Conprtion oF ArticLES IN Livinc Room 
To provide some objective rating of qualitative attributes of the living room, such as “aesthetic at- 
mosphere” or “general impression,” the following additional items may be noted. The visitor should check 
the words that seem to describe the situation. Some of the weights are of minus sign, and so operate as 
penalties to reduce the total score of the home. ` 


18, Cleanliness of room and furnishings 
a. Spotted or stained (—4) 


b. Dusty (—2) 
c Spotless and dustless (+2) 


19. Orderliness of room and furnishings 


a. Articles strewn about in disorder (2) > ——___—_--— 


b, Articles in place or in usable order ae Lge) 


20. Condition of repair of articles and furnishings 
a. Broken, scratched, frayed, ripped, or torn (—4) 
t. Articles or furnishings patched up (—2) ——————_—_————— 
c. Articles or furnishings in good repair and well kept (+2) 


21. Record your general impression of good taste 
a Bizarre, clashing, inharmonious, or offensive (—4) 
b. Drab, monotonous, neutral, inoffensive (—2). 
c Attractive in a positive way, harmonious, quiet and restful (+2) 
Total score,” Parts I and Immo mmm 


g 


Score on Part I. 


© With penalties deducted. (See paragraph 2 on the other side) + 


Ficure 7. (Continued). 
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ments of a banker and a janitor both of whom had always 
lived in the village. Figure 8 indicates the degree of cor- 
respondence between the scale scores and the subjective judg- 
ments.?” 

Chapin’s Social Status Scale 1933, is outstanding as a brief, 
simple, reliable, and valid instrument for measuring socio- 
economic status of urban families.** Its brevity and empha- 
sis on material possessions increase$, of course, the possibility 
of erroneous or less discriminating scores in unusual, indi- 
vidual families. The remarkable fact is that, in spite of the 
above limitations of the scale and the further fact that it was 
originally constructed from comprehensive data from a sam- 
ple of only 38 urban middle-class families in Minneapolis, 
it has shown remarkable reliability and validity in several 
thousand cases from a dozen widely scattered states. These 
results suggest (a) that there may be rather simple and stable 
elements in social situations of great apparent complexity 
(e.g, socio-economic status); and (b) that the discovery of 


hes Fs Lundberg, “The Mewsurement of Socio-economic Status,” Amar, 
+ Rev., 5:29-39, 1940. For purposes of this test the Chapin scores wer 
translated into a six point scale according to the following equivalents : 


1. Upper part — 250 and up 
Upper Class 2. tee co — 200-249 
Class { 3+ Upper part — 150-199 
oe 4 pan part — 100-149 
Class { 5 Upper part — 50-99 
6. Lower part — 0-49 


h ements in individual cases were also analyzed. (Ibid.) 
Lng correlation ents (uncorrected eta) were as follows: Chapin score 
i Ae ai rating, ņ =.51 +.04; Chapin score and banker's rating, n= 


The reasons for di 


TABLE V 


ency Distribution of Ratings of the Socio-Economic Status of 196 Families on a Sit 
Point Scale by (a) The Chapin Scale, (b) A Local Janitor, and (c) A Local Banker 


Number of Families Assigned to each 


Gradation of the Six-Point Scale 


23 For a digest of the results of t i rious of the 
United States, see F. S$. Chapin, hohe a bans E soe 
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No. of Homes 
`n 
Os8SB28388 


Socio-Economic Rating 


Ficure 8, Comparative Socio-Economic Rating of 196 Homes by a Janitor, a 
Banker and the Chapin Scale. 


these elements and their reduction to indices is the key to 
practical social measurement. 

Another excellent scale directed at somewhat the same 
problem as the Chapin scale but including more items and 
a broader variety of aspects is the Minnesota Home Status 
Index published by Alice M. Leahy in 1936.4 The scale was 
based on 600 urban white families with children in the age 
group 5-14. The families were selected from various occu- 
pational groups, and were scored on a total of 84 items. The 
differentiating power of each item was tested by the criterion 
of internal consistency. The fifty items finally selected were 
assigned weights by the sigma method. (These techniques 
will be discussed below.) The final scale consists of the fol- 
lowing categories: Children’s Facilities, 11 items ; Economic 
Status, 13 items; Cultural Status, 11 items; Sociality, +13 
items; Occupational Status, 1 item ; and Educational Status, 
6 items. The scale showed a reliability of +0.92 (corrected 


24 The Measurement of Urban Home Environment, (Univ. of Minnesota 
Press, 1936). 
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by the Spearman-Brown formula) by the method of split 
halves. Its validity is reflected by a correlation of +0.94 
with the Sims Score Card 25 for 200 homes, and the wide vari- 
ation in scores shown by the scale when applied to widely 
separated economic groups. The Home Status Index is one 
of the most carefully constructed and statistically adequate 
of existing scales.*° 

These scales were designed for urban families. A similar 
instrument to measure the socio-economic status of farm fam- 
ilies was developed by William H. Sewell and published in 
1940.27 Data on 123 items from 800 farm families in three 
counties of Oklahoma served as a basis for the construction 
of the scale. Sewell utilized all the experience of previous 
attempts and the construction of such scales as those reviewed 
above as well as several decades of earlier experience, and his 
methods must be regarded as a model of how to construct a 
scale of this kind. The student is urged to study the full text 
of Sewell’s original account, as only a summary of his pro- 
cedure is possible here. 


I, THE PRINCIPAL STEPS IN THE CONSTRUCTION OF 
SOCIOMETRIC SCALES 


al Item validation? The first problem in the construc- 
tion of a sociometric scale is to select items that will differen- 
tiate various degrees of the phenomenon measured. As we 
pointed out earlier in the chapter, the original selection of 
items (really in the nature of hypotheses) can best be done 
by people thoroughly familiar with the field to be measured. 
When a tentative list of items has been selected, their dis- 
criminatory value for purposes of measurement must be 

2% V. M. Sims, The Measurement of Socioeconomic Status, (Public School 


Publishing Co., Bloomington, Ill., 1928). 
pa Ae other scales aiming at somewhat the same objectives should be men: 
tioned: M. J. McCormick, A Scale for Measuring Social Adequacy, Social 
eam abi > te $ No. 3, Washington, D. C., 1930; E. M. Burdick, “A 
roup Test o i ” i i 7 8. 
Ay Wak, H. Seva hick Archives of Psychol., No. 101, 1928 
28 For a review of this subject and an evaluation of the various techniques: 
see J. H. Long, P. Sandiford, et al, The Validation of Test Items, (Educa- 
tional Research Bull. No. 3, Univ. of Toronto Press, 1935). 
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tested. Three general methods of testing items for this pur- 
pose are in common use. One consists of correlating test 
items with some acceptable external criterion (judgments 
of experts, other tests, etc.) Another method consists of cor- 
relating individual items (usually by the “‘bi-serial 7” method) 
with the test as a whole or with particular internal criteria. 
A third method, which avoids the laboriousness of the cor- 
relation techniques and gives substantially the same results, 
is the “critical ratio” method. This consists of a comparison 
of the frequency of occurrence or te are of each 
item in relation to the total score in selected segments of the 
distribution. The procedure is as follows: (1) A total score 
is calculated for each person taking the test, (or for each fam- 
ily scored). (2) Papers are placed in rank order on the basis 
of total score. (3) Equal segments at the extremes of the 
distribution are chosen. (4) Percentage differences between 
these segments are then calculated for each item. (5) The 
statistical significance of the difference is determined for 
each item (critical ratio). (6) Items with largest significant 
differences are said to have great discriminating power and 
are retained for the final scale. This method assumes that 
the items are normally distributed in the general popula- 
tion, and, (as is true of all criteria of internal consistency) 
that the total score is an adequate criterion for validation. 

Some limitations of tests of internal consistency as meas- 
ures of validity of test items have been noted by Sletto* and 
Mosier : 2? (1) When a small sample is used chance errors 
may play a large part in the ranking of items in discrimina- 
tive value from group to group. (g) If the sample and the 
number of items in the preliminary scale are small, a con- 

20 A third method for selecting test items, namely, factor analysis, should 
be mentioned as of possibly great importance. Its principal practical diffi- 
culty at present is its laboriousness in cases where the number of items and 
the sam) fe are large. See L. L. Thurstone, The Theory of Multiple Factors, 
(Edwards Bros., Inc, Ann Arbor, Mich., 1933); Simplified Multiple Factor 


Method, (Univ. of Chicago Book Store, 1933); The Vectors of Mind, (Univ. 


of Chicago Press, 1935). 
3 Cf. Sewell, op. cit, p. 31. The method is described further below. 


31 R. F. Sletto, The Construction of Personality Scales by the Criterion of 
Internal Consistency, (The Sociological Press, Hanover, N. H., 1937), pp- 


81-83. 
ae, I. Mosier, “A Note on Item Analysis and the Criterion of Internal 


Consistency,” Psychometrica, 1:275-82, 1936. 
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siderable error in the discriminative value of an item may 
arise from including the item in the score on the prelimi. 
nary scale when determining its discriminative value. (3) 
Statistically significant critical ratios for all items in a scale 
do not assure that all items are measuring a single common 
factor.’ 

“However, these limitations need not invalidate the use 
of the criterion of internal consistency in test construction. 
The first weakness may be overcome by the use of a large 
sample. Sletto has shown that chance errors play a small 
part in the discriminative value of items when the sample in- 
cludes as many as 400 cases. The second limitation may be 
overcome by the use of a technique devised by Sletto to com- 
pensate for the displacement increment. Since the displace- 
ment increment bears an inverse relation to the number of 
items included in the preliminary scale, it may also be mini- 
mized by using a large number of items. The third limitation 
is probably inherent in all techniques other than that of fac- 
tor analysis. The only way known at present to make sure 
that each item in a scale is measuring a single common vari- 
able is to make use of the complicated factor analysis tech- 
nique.” 34 

By the above methods it is possible to select those items 
which discriminate most sharply between different quartiles 
(or other divisions) of the sample from which data for the 
construction of the scale is drawn. For examiple, if, as Sewell 
found, the item “separate living room" is present in only 
5-5 percent of the cases in the lowest one-fourth of the sam- 
ple (as measured by the total score), in 25 percent of the 
next lowest one-fourth, in 48 percent of the third quarter, 
and in 82.5 percent of the highest scoring quarter of the sam- 
ple, this indicates that the particular item has high and con- 
sistent discriminative value. Of course, many items showing 
less extreme differences may still be of discriminative value. 
The statistical significance (i.e., the probability that a differ- 
ence is due to other than chance fluctuation in the sample) 
of the difference between two percentages may be comput 


8 Cf, R. F. Sletto, op. cit. 
% William Sewell, op. cit, p. $1. 
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by simple methods described in any standard statistical text.* 
When the probability that a given difference between two 
percentages might be due to chance is only 1 to 5 in a 100 
(i.e., when the critical ratio is from 3 to 2), we assume that 
the observed percentage difference is highly likely to reflect 
a difference of the kind we wish our items to indicate. We 
would consequently include such items in our scale. When 
this criterion was applied to all percentage differences in the 
presence or absence of items in successive quartiles in Sewell’s 
scale, 82 items of the original 123 were found not to be dis- 
criminatory to the required degree and hence were elimi- 
nated.,** 

2. The weighting of items. When the items of a scale 
have been selected, scores or weights must be assigned to 
them. The simplest method that suggests itself is to assign 
zero to the absence of an item and 1 to its presence. In the 
case of graduated items, multiples of 1 may be assigned for 
successive levels. This assignment of values is, of course, en- 
tirely arbitrary, but in many cases yields as satisfactory 
results as more complicated and theoretically defensible 
methods. 

Leahy *7 describes a second method, the difference method, 
based on the reliability of the percentage differences of pos- 
session among successive quintiles of the population on which 
the scale is based. This is in part the same procedure as has 
been described above in the determination of the discrimi- 


85 When j: is desired to determine the significance of a large number of 
differences, as in the construction of a test, the Edgerton-Patterson tables 
should be used. H. A. Edgerton and D. G. Patterson, “Table of Standard 
Errors and Probable Errors of Percentages for Varying Numbers of Cases,” 
Jour. Applied Psych., 10:378-91, 1926. (Obtainable from the Editor of the 
above journal for 25 cents.) 

The critical ratio is computed as follows : 

(1) The standard error of the difference of two percentages 

odifference =V op: F op? 
(2) The critical ratio 
difference between the two percentages 
— standard error of the difference as in (1) 
__ difference 
= y dit. 

3 Five others were dropped for practical reasons leaving a total of 36 
items in the final scale divided as follows: I. Material possessions, 15; II, 
Cultural possessions, 13 ; IIT. Social participation, 8. k ja 

37 Alice M, Leahy, The Measurement of Urban Home Environment, (Univ. 
bf Minnesota Press, 1936), pp. 41-42. 
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natory power of different items. The weight assigned to an 
item is the average of the critical ratios secured from a deter- 
mination of the significance of the percentage differences be- 
tween successive portions (in this case quintiles) of the popu- 
lation.88 

A third method, and that finally adopted by both Leahy 
and Sewell, is the sigma method. This method weights an 
item in inverse relation to its frequency in the total popula- 
tion. Items that occur rarely are given the greatest weight, 
and absence of an item, as well as presence of it, receives a 
score value. Leahy states the assumptions of the method as 
follows: “As in the difference method, each item is assumed 
to be normally distributed. Zn addition, when, as in this 
study, the possession of an item is regarded as desirable, there 
is the assumption that possession deviates on the positive 
side of the mean of the whole distribution with 100 per cent 
as its termination point. Ou the other hand nonpossession 
of the item deviates in a negative direction from the mean 
of the whole distribution with the goth percentile as its ter- 
mination point. Furthermore, it posits that the most typical 
figure for percentage of either possession or nonpossession is 
one-half the observed percentage frequency.” 39 

Sewell gives the following illustration of the application 
of this method : 

“The calculation of the sigma score may be illustrated by 
computing the score for an item of the scale, such as item 38 
‘radio.’ This was possessed by 46.1 percent and not possessed 
by 53.9 percent of the sample families, The calculation of 
the score for possession of the item follows: 

100 — [ 6 = 76.9 

The distance of 76.9 percent, which is +0.737 sigmas, is read 
from a table of values of the normal probability integral.“ 

uD 

Zg dit. 
5 

® Alice M. Leahy, op, cits 

“ z i 
ra ae erir 


editien, Columbia Univ. Press, 1919), Table . However, 7 
standard table for areas under the normal curve mip iS ond. 
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This may be used as the score for the possession of a radio. 
For non-possession the score is computed as follows: the typi- 
cal value of the observed frequency is determined, in this 
instance 53.9-+2= 26.9; then, regarding the poth percentile 
as the termination point for this segment of the distribution, 
the sigma value of 26.9 percent is read from the table. This 
is —o.615 sigmas and may be regarded as the score for non- 
possession of the item. 

“In the case of graduated items, the cumulative percentage 
frequency distribution of this item is determined and the 
end of each successive truncated section is considered as the 
termination point from which the typical value would be 
located. For example, item 60, ‘magazines,’ has the follow- 
ing percentage distribution and sigma values : 


Number of 
Magazines 


“As is apparent from the examples, sigma scoring produces 
negative as well as positive weights. However, all scores may 
be made positive by adding a constant to each of the original 
sigma weights. To increase further the magnitude of the 
resulting scores, all may be multiplied by a constant. In the 
present case, 2 was used as the constant in both operations. 
When this procedure is applied in the above examples, the 
score for possession of a radio becomes 5 and the score for 
non-possession becomes 2. The scores for possession of vary- 
ing quantities of the graduated item, ‘magazines,’ are shown 
in the extreme right-hand column of the example above.” 4 

In conclusion it should be said that the principal advan- 
tage of the sigma method is that it expresses scores in stand- 
ard units. Both Leahy and Sewell found, however, that the 
sigma method yielded results almost identical with either the 


41 Wm. Sewell, op. cit, pp. 43°46. 
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simple scoring or the difference methods described above.‘ 
3. Standardization of a scale. When a scale shows satis- 
factory reliability and validity we say that it is standardized. 
Reliability refers to the ability of a scale to measure consist- 
ently, i.e., repeatedly and in the hands of different investi- 
gators, whatever it purports to measure. Validity is usually 
interpreted to mean simply that a scale measures that which 
it professes to measure, or which we want it to measure. We 
have dealt with both concepts, and the conventional tests of 
each, in several previous connections. The tests of reliability 
are rélatively simple, objective, and non-controversial. ‘The 
question of validity, however, involves some theoretical ques- 
tions of utmost importance to the future of the kind of meas- 
urement discussed in this book. We conclude, therefore, 
with a brief consideration of this subject. 
` What people usually mean by validity is that a scale pur- 
porting to measure something (designated by a familiar word) 
should agree with common-sense judgments of the phenom- 
enon. That is, we expect a scale of socio-economic status to 
yield low scores for the beggar and for people on relief, and 
high scores for the banker and the factory owner. There can 
be no doubt about the general usefulness of this ever-ready, 
common-sense test of validity. In the absence of more for- 
„ mal and crucial tests, common-sense estimates of the degree 
to which an instrument aids the adjustment sought in con- 
crete cases will always be a practical test of the validity of the 
instrument. But this fact should not deceive us into assum- 
ing that the validity of scientific instruments depends always 
or entirely upon their conformity with the immediate find- 
ings of common-sense. Indeed, this is a criterion very gen- 
erally disregarded in fields where well-recognized measuring 
instruments have been developed. For example, no one 
would think of regarding his own common-sense impressions 
of temperature, weight, distance, or composition of the liq- 
uid in the radiator of his car, as more reliable than the read- 
ings of the appropriate instrument in each case. What, then, 
have been the criteria upon which the validity of these n- 
struments have gained acceptance ? 


Ibid., p. 48. footnote 22. The correlation coefficients found between the 
results of t ifferent methods by both investigators were all -+0.98 or above. 
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The crucial test has been whether the readings of the in- 
struments, when applied to the solution of some adjustment 
problem other than the mere confirmation of common-sense, 
have contributed toward the solution of this larger problem, 
toward the solution of which the individual measurement in 
question was only a means. That is, the criterion of the va- 
lidity of the hydrometer is not whether it confirms our com- 
mon-sense estimate based on smell or taste as to the amount 
of alcohol in the solution in the radiator of our car. The 
criterion is whether an acceptance of the reading of the in- 
strument and acting on it keeps the radiator from freezing 
in a higher proportion of cases than by following our com- 
mon-sense impressions. The same point could be illustrated 
even more strikingly with reference to the measuring instru- 
ments of height and weight. Note that the conformity of 
the readings of the hydrometer to our common-sense impres- 
sions of the amount of antifreeze liquid in the radiator is in 
this case wisely regarded as completely irrelevant as a test of 
the validity of the instrument. 

The assumption that instruments for the measurement of 
social phenomena to be valid must, necessarily conform in 
their results to a consensus of common-sense findings is, there- 
fore, entirely unwarranted. This does not mean that when, 
as in the case of the Chapin scale which has been described, 
the results of common-sense estimates and standardized scales 
are in high agreement, this may not be of interest and value. 
We are merely emphasizing that this criterion is not a neces- 
sary or even a primary one. It is not a complete or final test 
of the validity or usefulness of measuring instruments of any 
kind. The more important criterion is the capacity of the 
instrument to achieve with high reliability any significant 
designated type of resudt. The conformity of the readings of 
standardized scales to our common-sense reactions to the 
(presumably) same phenomenon may 
of esthetic or linguistic rather than of 

From the beginning of psychological, “mental,” and social 
measurement, the argument has persisted in some quarters 
that, however clever the instruments, and consistent and prac- 
tically valuable the results of their use, there remains, never- 
theless, the theoretical obstacle that we do not know “what 


be said to be chiefly 
scientific interest. 
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they measure.” The feeling seems to be that this “what” 
must be some entity outside and beyond the conditions and 
behaviors (the stimuli) referred to by the symbols of the 
scale. This is an illusory problem springing from untenable 
philosophical preconceptions, and more especially from a 
misunderstanding of the nature of all symbolic, linguistic, 
and semantic phenomena. But the view is sufficiently com- 
mon to warrant our attention. 

When a new measuring instrument is invented in any 
field, the question always arises, “What does it measure?” 
Since a measuring scale is a symbolic device consisting of a 
serial set and system of symbols for the designation of a gra- 
dation of certain kinds of conditions, qualities, events, or 
behaviors, the question of what is referred to by the scale is, 
of course, not fundamentally different from a similar ques- 
tion with regard to any new word which may be proposed 
to designate some phenomenon. The answer in the latter 
case, on elementary levels, is to point, gesticulate, or engage 
in some other overt and non-linguistic form of behavior. 
That is, as we shall see, also the only final way to answer the 
question of what a scale measures. In the case of folk-lan- 
guage designations and in the case of well-established meas- 
uring devices, however, we learn these meanings as we learn 
language in general and never have occasion to analyze ex- 
actly what we mean by them. As a result, we frequently 
teify these designations into entities, which are then assumed 
to have a self-sufficient existence quite apart from the condi- 
tions, qualities, events, and behaviors which the word was 
originally invented to designate. The word “intelligence,” 
as generally used today, is a good illustration“? Any attempt 
to formalize, objectify, and render more explicit such a term 
is, therefore, likely to encounter the objection that the refer- 
ent (¢.g., behaviors or conditions considered in constructing 


ioe for example, M. May, “Ten Tests of Measurement,” Educ. Ret» 


te is intelligent. H be sure that 
it exists (sic) as a detectable ay of fases ka i the fame that mass 


chologist be sure that it {the intelligence tesi) eres a PY 
nothing else?” (p. 207). I have d r cabjes ay inte! igencs = 
OE7, (Macmillan, 1939), Pp. 446-57. 
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a scale) designated by the new term unfortunately is not the 
original entity. Of course, it is not. Symbols never are 
what they designate. A slightly more sophisticated objec- 
tion is that the new symbol does not refer to all referential 
aspects or implications of some well-known term which the 
new term aims to supplant. If the new term has been ade- 
quately defined, it will make explicit exactly what it does or 
does not cover. We do not thereby declare omitted aspects 
unimportant or less significant. The omitted aspects or ref- 
erents may be designated by some equally explicit additional 
symbol. 

Consider, for example, how readily the term “socio-eco- 
nomic status” passes from mouth to mouth even among soci- 
ologists and social workers on the assumption that all com- 
petent people know what it designates. When a formal and 
rigorously defined scale for measuring socio-economic status 
is invented and applied, the question arises as to “what,” 
after all, it measures; whether what it measures is really 
socio-economic status, etc. Why do these appear to be iffi- 
cult and important questions in the case of the scale, but 
more or less self-evident in the case of our everyday use of 
such a term as socio-economic status? The question of what 
is designated by the term is a highly proper one in both cases. 
The amusing thing is that only in the case of the scale can 
an adequate answer be given, whereas the reverse is com- 
monly implied to be the case. Scales had their origin in 
precisely this fact that the more perspicacious individuals real- 
ized that it is only in such form that we can answer the ques- 
tion of what we mean by given expressions of gradations in 
socio-economic status. This meaning, in terms of the condi- 
tions and behaviors taken into consideration by the scale, is 
also the only possible answer to the question of what any 
scale or any informal judgment measures.# The statement 


44 Possibly the reason for misunderstanding on this subject is failure to 
keep in mind that all the operations in the construction and calibration of 
a measuring device are part of the ieai geen fe Every time we use 
a measuring instrument, the operations involved in its calibration are im- 
plicitly present. Another possible reason for the misunderstandings regard- 
ing such a statement as “intelligence is what the tests test” may be the 
assumption that only a certain way of doing the particular acts involved in 
taking a given test (e.g, putting round pegs in round holes) constitutes the 
referent of the word “intelligence.” Actually, the word is used to designate 
a way of doing these particular acts and certain ways of doing all other acts 
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that socio-economic status is what a scale for measuring socio- 
economic status measures, has therefore the same validity as 
to say that the conditions and the behaviors which any group 
calls high socio-economic status is high socio-economic status 
for that group. To deny either statement and to argue or 
imply that, in addition to the above’ explicit and avowedly 
relativistic definition, there is an absolute definition of socio- 
economic status (what status “really is”), independent of any 
culture, conditions, or behavior, is surely a linguistic delu- 
sion. It is like arguing that we are making a mistake in call- 
ing water wet, because it is really not wet at all. Formal 
instruments of measurement, by explicitly recognizing the 
conditions, behaviors, and standards involved in all measure- 
ment, strip the mystery from many words and hence cause 
temporary resistance to the acceptance of new instruments. 
To assume that any scales whatever can be said to measure 
some more ultimate “what” than the conditions, events, and 
behaviors its symbolism refers to, is pure mysticism, not to 
say a type of superstition. 

A further reason for the above confusion seems to be that 
formal scales undoubtedly and necessarily neglect some com- 
ponents which each individual includes in that complex re 
sponse to which he attaches the words socio-economic status. 
The scale makes us aware of this fact. If we feel that our 
personal definition, including our feeling tones associated 
therewith, is the only proper or true definition of the phe- 
nomenon under consideration, then the scale is, of course, 
invalidated by any disagreement on its part, at least to the 
degree that it differs from our common-sense discrimination. 
But this agreement of a measuring device with either an in- 
pon a 


which are so highly correlated with the w. doing the particular acts of 
pot asa ae i e former can be predicted from the laslas and vice versa. 
i n an ese circumstances, it is still true that the particular intelligence test 
Hote $ the correlated behavior as well as the particular acts involved in the 
tan ence the statement that intelligence is what the tests test. “What 

re refers to all the certain ways of behaving which the test reliably dis- 
pean and names in terms of scores, Intelligence, then, is the symbol 
be! stiles employ to designate a certain aspect or evaluation of behavior, 
wach voli the particular acts in connection with which it occurs. Tests 
i telli reliably discriminate degrees of this aspect of behavior are called 
nteligence tests and what they test is called intelligence. For a fuller state- 


pearl gag position, see my Foundations of Sociology, (Macmillan, 1939) 
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dividual or a consensus of common-sense judgments is not 
the only possible or even a primary object in scale construc- 
tion. Scales may aim instead merely at the reliable discrimi- 
nation of any conditions whatever which enable us to make 
successful adjustments. Sometimes these conditions as dis- 
criminated by instruments are at wide variance with the 
findings of common-sense. In many such cases we find it ex- 
tremely wise to ignore the findings of common-sense and to 
follow those of the instrument. This is likely to be increas- 
ingly true as we have to adjust to remote and complex en- 
vironments. ta 

Our preference for the “qualitative” or prescalar use of 
such a phrase as socio-economic status is undoubtedly attrib- 
utable to the fact that no one is compelled to break it up 
into components or to consider the relative weights he gives 
to each factor. The whole procedure is a deliciously private 
and subjective reaction for which we are not consciously ac- 
countable to anyone. Consequently, we feel strongly that 
the meaning we attach to such words has some intrinsic va- 
lidity or fitness, as, for example, when children (and others) 
are impressed with the peculiar fitness of the word “cold” to 
designate a certain degree of temperature. For people reared 
in the same culture, and who receive their language-values 
from a fairly uniform source, there will be a certain uniform- 
ity in the use of such terms, usually sufficient to serve most 
primary-group purposes. When we come in contact with 
people of different culture backgrounds, and therefore with 
different word meanings, we marvel at their gross misjudg- 
ment of such matters as socio-economic status, living wage, 
decency, and so forth, and bring to bear against them unflat- 
tering emotions. 

As science advances, we find less and less interest in such 
questions, for example, as “what” electricity is. Most people 
find it sufficient to define what electricity is in terms of what 
it does. It is that which under certain circumstances kills 
people, makes trains go, flashes in the clouds, illuminates 
lamps, makes the hand of the voltmeter move to a certain 
point, etc. As social science advances, we shall doubtless also 
find this type of answer adequate for the question as to what 
socio-economic status is. We shall be content to say that it is 
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that which under certain circumstances makes people beg on 
streets, fawn before the local banker, and behave arrogantly 
to a janitor or waiter. We shall say it is that status which is 
associated with cértain kinds of houses, food, clothing, edu- 
cation, occupation; more specifically, we shall probably say 
that a person will be accorded status, i.e., people will behave 
toward him according to their estimate of the probability 
that he will achieve the maximum goals of socio-economic 
striving. That is what we have meant by the term in pre- 
quantitative days; it is likely to continue to be what we 
mean by it under a quantitative terminology, except that we 
shall state it in terms of a number of units on a scale. That 
is in either case the only possible answer to the question of 
what socio-economic status is or what a socio-economic scale 
measures. 

In short, it is only when we have a quantitative scale for 
measuring socio-economic status that we can give an explicit 
account of what we are measuring. We can enumerate or 
point to the items which enter into the construction of the 
scale and the proportional weight which each item is allowed 
in the total score. This is not possible when the general dic- 
tionary definition or folk usage is allowed to prevail and de- 
termine its meaning. We can never know whether another 
person means quite the same by the phrase as we do. When 
we try to determine this, we find ourselves constructing, for- 
mally or informally, a quantitative scale, 


J- CONCLUSION 


In this and the preceding chapter we have been concerned 
with some of the more recent and refined instruments for 
the measurement of social phenomena. The examples of- 
fered are, of course, merely illustrations of the large number 
of indices, scales, and other devices that have already been 
invented for the more objective study of social behavior. 
The examples we have used in the present chapter have 
been concerned more with non-verbal data, as compared 
with the attitude tests considered in the preceding chapter. 
These two types of instruments are not fundamentally dif- 
ferent, however, as attitudinal behavior is always an aspect 
of what we have called institutional behavior. Conversely, 
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the “material” culture which has figured heavily in the scales 
considered in the present chapter, has been used as an index 
of attitudinal behavior. Socio-economic status, for example, 
might be studied either by attitude scales of the type dis- 
cussed in the preceding chapter or by instruments like the 
Chapin and the Sewell scales. The accompanying biblio- 
graphical references ‘give some idea of the number and vari- 
ety of instruments that have already been invented for this 
type of social measurement. 

We have seen that the same needs which have given rise 
to the various measuring instruments of physical science, the 
system of grading goods, services, and achievement, are grad- 
ually forcing the development of scales for the measurement 
of social institutions. The history of these sociometric 
scales is the same as that of all measuring devices—a gradual 
transition from purely subjective and qualitative judgments 
to increasingly objective and quantitative methods. 

One of the principal obstacles to the development of such 
scales in the social sciences has been the widespread feeling 
that the services and values of many social institutions are so 
“intangible” or “spiritual” that they cannot be measured. 
We have shown, however, that all these values, if they are 
socially significant, have objective manifestations subject to 
quantitative measurement and comparison. It is true that 
for many forms of social behavior we do not as yet have 
standardized measures. This is the second great obstacle to 
the construction of valid sociometric scales. But their devel- 
opment must come through the constant revision and im- 
provement of tentative measures constructed from the com- 
bined judgment of persons best qualified to judge. We have 
discussed methods of selecting such persons, and the prin- 
ciples which should govern the selection of units, standards, 
and weights. Finally, we have considered some concrete ex- 
amples of the application of these principles, and the result- 
ing sociometric scales. } 

These attempts represent a recognition of the crucial role 
of measurement in all science, as argued in the opening 
chapters. The undeveloped state of technics and instru- 
ments for this purpose in the social sciences is unquestion- 
ably a major obstacle to efficient and economical research in 
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these fields. The matter can perhaps. be summarized most 
briefly by imagining what would be the state of knowledge, 
research, and practical efficiency in the. physical sciences if 
every researcher had to invent anew for each research proj- 
ect, instruments for the measurement of time, length, weight, 
temperature, etc. The corresponding state of affairs is vir- 
tually what today exists in the social sciences. There is no 
systematic theory which stipulates what we choose to regard 
as fundamental dimensions of social behavior and there are 
few standardized and verifiable definitions and measures of 
the various concepts and dimensions used by various re- 
searchers.“ Deliberate and coordinated work on this key 
problem might properly engage some of the attention of 
research foundations interested in fundamental research. 


a 
. 
hg 


K. SUGGESTIONS FOR FURTHER STUDY 


1. Perhaps the best summary and bibliography in exist- 
ence of measuring instruments of the type considered in this 
chapter is Wm. H. Sewell, The Construction and Standard- 
ization of a Scale for the Measurement of the Socio-Economic 
Status of Oklahoma Farm Families, Oklahoma Agricultural 
and Mechanical College, Agricultural Experiment Station, 
Technical Bull. No. 9 April, 1940 (Stillwater, Oklahoma). 
Part I of this excellent bulletin summarizes the story of some 
twenty-five attempts during the decades of 1910-40 to meas- 
ure socio-economic status. The rest of the bulletin gives in 
detail the steps in the construction of such a scale. The en- 
tire bulletin is. recommended as supplementary reading to 
the present chapter. 

2. The details in the construction of S. C. Dodd's scale for 
measuring rural hygiene in Syria have been mentioned in sev- 
eral previous connections, Part I of his book, A Controlled 
Experiment on Rural Hygiene in Syria, (Oxford Univ. Press, 

#5 This work will necessarily, to begin with at least, be heuristic in nature, 
rh veyed poset on srs samy ip deme ee ae te of one 


of the most comprehensive and systematic of such attacks is found in $. C 
Dodd, Dimensions of Society, (Macmillan, 4) z 4 
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1934), remains a classic example of the subject discussed in 
the present chapter. 

3. For a detailed discussion of the methods and problems 
of constructing a sociometric scale with special reference to 
the measurement of morale, see E. A. Rundquist and R. F. 
Sletto, Personality in the Depression, (Univ. of Minn. Press, 
1936). 

4. An excellent discussion of the tests of significance will 
be found in J. H. Smith, Tests of Significance : What They 
Mean and How to Use Them, (Univ. of Chicago Press, 
1939). 

5. Three important indices for measuring rural planes of 
living from census and other published sources of informa- 
tion are the following: 

a. C. E. Lively and R. N. Almack, A Method of Deter- 
mining Rural Social Sub-Areas with Application to Ohio, 
(Department of Rural Economics, Mimeograph Bulletin 
106, Ohio State Univ., Columbus, O., 1938). 

b. M. M. Blair, Indices of Levels of Living for the Thir- 
teen Southern States by Counties, 1930, (Oklahoma Agri- 
cultural and Mechanical College, Social Science Research 
Council, Bull. No. 2, Stillwater, Okla., 1939). 

c. C. Goodrich, B. W. Allin, and M. Hayes, Migration 
and Planes of Living, (Univ. of Pennsylvania Press, Phila- 
delphia, 1935). 

6. A very useful: book of suggestions as to items of impor- 
tance in different aspects of community life is Joanna C. Col- 
cord, Your Community, (Russell Sage Foundation, 2nd Edi- 
tion, 1939). 

7. Some ingenious suggestions for the development of 
scales for measuring the degree to which» motion-picture 
plays violate community mores may be found in C. C. Peters, 
Motion Pictures and Standards of Morality, (Macmillan, 
1933)- +p 

8. For a type of social measurement not treated in this 
book, but which is of basic importance, see G. K. Zipf, Na- 
tional Unity and Disunity, (Principia Press, Bloomington; 
Ind., 1941). See also, my Foundations of Sociology, (Mac- 


millan, 1939), Chaps. 9-13. 


CHAPTER X 


SOCIOMETRIC METHODS IN ECOLOGY AND 
INTERPERSONAL RELATIONS 


Perhaps the relations of a society, a social system, and the individuals 
who compose the society can be made clearer by an analogy. A rope can be 
analyzed into its component strands with respect both to its total length 
and to its content at any point in that eget. Its structural pattern can 
also be ascertained by observing the spatial relations of the various strands 
at a series of points along its length. This pattern will persist in spite of 
the termination of certain strands and the introduction of others and will 
bear little relation to, the individual qualities of the various strands — such 
things as their exact fength, thickness, or color, It can be described in the 
abstract and compared with the structural patterns revealed by other ropes. 
A society, in its extension through time, can be likened to a rope braided 
from the short strands of individual lives. By studying the social relations 
between individuals and observing the repetitive situations, it becomes pos- 
sible to deduce the structural pattern of the society. This pattern tends to 
persist in spite of the steady turnover in the society's content and bears little 
relation to the special qualities of the various individuals who occupy places 
in the Society at various times. It can be described in the abstract and com- 
pared with the structural patterns of other societies. — Ralph Linton. 


A. TYPES OF SYMBOLIC REPRESENTATION OF SOCIAL 
PHENOMENA 


The social sciences are concerned with the explanation of 
Social groupings and the interactions that are called group 
behavior. To carry on this work it is necessary, as we have 
Seen in previous chapters, to develop objective methods of 
depicting symbolically the phenomena in which we are in- 
terested. In order to emphasize that all of the detailed 
methods we have considered above are merely special tech- 
niques in this comprehensive enterprise, and in order to 
illustrate the relationship of all of these methods, we shall 
summarize in the present chapter some of the principal types 
of objective s lic representation. We find it convenient 
to discuss and illustrate this development in four general 
categories. : 

4 Geographers and ecologists were among the first to de- 
velop objective symbolic methods regarding certain aspects 
of h groupings. Geographic and ecological maps show 
human relationships with special reference to their distribu- 


1“A Neglected of ” iol, 45:870 
8, 1940, E. Ora Aspect of Social Organization,” Amer, Jour, Sociol., 45 
‘ tio 
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tion and function in geographic space. In its more elemen- 
tary forms, this type of symbolic representation is highly ob* 
jective and generally understood because it is almost a kind 
of picture-writing. Also, cartographic methods of representa- 
tion have hitherto been applied chiefly to those aspects of 
the community which are conspicuously incorporated in the 
so-called material culture, for the objective study of which 
our symbolic techniques have always been more highly de- 
veloped. The representation on paper of such data as area, 
direction, topography, channels of communication, land uti- 
lization, economic areas, etc., is, therefore, relatively stand- 
ardized and generally understood. As a result, we ascribe to 
those aspects of the community which can be depicted or 
represented by the simpler cartographic methods a “‘tangi- 
bility,” “concreteness,” and “reality,” which we withhold 
from aspects for which we have not yet developed objective 
symbolic techniques.* 

2. After mere mapping, the next step in the process of 
describing the community was to convert behavior itself into 
symbolic indices which can in turn be represented in geo- 
metric, isometric, isotypical, or gradient terms.* Thus birth, 
death, disease, and delinquency rates, or rates for any other 
behavior phenomena can be computed for local areas and 
the relationship of such phenomena to these regions can be 
shown by gradient lines or other geometric devices. A line 
on the same map representing the declining delinquency 
rate from the center of the city toward the periphery, de- 
notes a phenomenon just as real, observable and meaningful 
as a line representing the main street. (See Figures 10 and 12 
below.) Yet such is the degree of our unfamiliarity with 
such abstractions as rates, that we feel the aa representing 
the street has a certain “reality,” legitimacy, and relevance 
to the “true” representation of a community which it is felt 


2 For a development of this position, namely, that tangibility, concreteness, 
etc., are not characteristics inherent in data but merely words describing the 
degree to which we have ee objective symbolic techniques of dealing 
with such data, see G. A. Lundberg, “The Thoughtwa' of Contemporary 
Sociology,” Amer, Sociol. Rev., 1:703-23. 1936. Also A. F. Bentley, Behavior 
Knowledge Fact, (Principia Press, Bloomington, Ind., 1935), Chaps. 19, 21. 

See C. R. Shaw, Delinquency Areas, (Univ. of Chicago Press, 1929). Elsa 
S. Longmoor and E. F. Young, “Ecological Interrelationships of Juvenile 
Delinquency and Population Mobility,” Amer. Jour. Sociol, 1:598-610, 1936. 
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the delinquency gradient has not. Both types of lines repre- 
sent both a structural and a functional phenomenon. At 
present many of these behavior phenomena have not yet been 
metricized. Until they are reduced to metric units we may 
deal with them by topological constructs, or in terms of sym- 
bolic logic. This may be an intermediate step toward met- 
rical statement.* The correlation * of these various indices, 
when worked out for all the relevant aspects of community 
behavior, would represent a comprehensive description and 
explanation of the life of a community. 

3. The aspect of community life that this approach seems 
to neglect is the so-called “human” psychological factor of 
social interaction itself. Hence the next step is to invent 
objective devices for the representation of these processes 
which are at present dealt with largely in the language of 
literature and philosophy. 

It will be observed that the transition in the technique of 
representation from an ecological spot or rate map of delin- 
quency to the organization or functional chart represents 
merely a transition (a) from geographic to social space rep- 
resentation, and (b) from the relatively pictorial technique 
of the delinquency map to a more abstract form of symbolic 
representation of the behavior depicted by the functional 
chart. That is, the lines in an organization chart do not cor- 
respond to roads, railways, etc., in geographic space, but rep- 
resent relationships in social space,® such as the relation of 
the mayor to subordinate officials, just as a political bound- 
ary or a delinquency gradient represents a behavior phenom- 
enon relating two areas of a city in geographic space. In 
both cases the lines stand fundamentally for behavior phe- 
nomena of concern to social science. In the same way, purely 
topological representations or the characters of symbolic 


‘See J. F. Brown, . cit, 476 ff. Also K. Lewin, Principles of Topological 
ill, 1936), Part TI; and especially Dodd’s Dimensions 


“ ; used in its broadest sense. See G. A. Lund- 
berg » “Quantitative Methods in Social Psychology,” Amer. Sociol. Rev. 1: 


ecf. J. F. Brown, “On the Use of Mathematica in Psychological Theory.” 

i logica ' 

Psychometrika, 1:77-90, 1936. See also, P. A. Sorokin, Social Mobility, (Har- 

prem ‘9 deck don ace emnemnatical statement of the same point 
possible. P » Of course, supersede the geometric as soon 
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logic are merely other literary methods of representing rela- 
tions. These points are made for the benefit of those whé 
feel that any other language than the customary philosophi- 
cal terminology of discussing the subtler community rela- 
tions is “artificial,” “cold,” rigid, and lacking’in color. All 
symbolic representation of behavior is “artificial” in exactly 
the same way. The coldness, rigidity, and lack of color of 
the topological and mathematical symbolism may be the 
chief reason for its superiority as scientific language. It is, 
in fact, because philosophical vocabulary is so suffused with 
subtle and varied emotional tones and meanings that it turns 
out to be so often inferior in scientific discourse and hence 
is supplanted by geometric and other mathematical symbols. 
It is significant to note how well other sciences get along in 
explaining the most intricate phenomena without any of the 
type of language alleged to be so essential in sociology. 

Very similar to the formal, official, or latent organization 
of the whole community is a network of so-called private 
groups—families, religious bodies, clubs, fraternities, etc. 
These have a considerable degree of formal organization and 
permanence, but are usually thought of as relatively selec- 
tive and “voluntary” instead of inclusive and automatic in 
their membership.” Also they are more limited and special- 
ized in the type of interests which give rise to them. This 
level of community organization is usually regarded as less 
“tangible” than the formal political organization of the whole 
community. The feeling that these informal groupings are 
less tangible is due to the relative absence of objective and 
systematic information about them and the relatively un- 
stable and shifting nature of most of them. Attempts to 
study and chart this level of community structure are, how- 
ever, becoming increasingly objective.* This is notably true 


1 The family is probably an exception to this statement. But it is for pres- 
ent purposes classified among the so-called voluntary associations arising 
from drives of relatively widespread and permanent distribution, and hence 
sociologically compatabee to the religious, fraternal, recreational and socio- 
economic organizations that make up the bulk of the “voluntary” associa- 
tions of a community. A e 4 

8See Mirra Komarovsky, “A Comparative Study of the Voluntary Organi- 
zations of Two Suburban Communities,” Publ. Amer. Sociol. Soc., 27:83-93, 
1933. Also, by the same author, The Unemployed Man and His have A 
(Institute of Social Research, New York, 1940). Also, G. A. Lundberg, et al., 
Leisure, (Columbia Univ. Press, 1934), Chap. 5; F. S. Chapin, Contempo- 
rary American Institutions, (Harper, 1935), Chaps. 16, 18. 
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of studies of family structure, function, and behavior. 
» 4. Finally, there is, underlying these private and more or 
less “voluntary” social groupings, the tremendously intricate 
and elementary network of informal social nuclei which we 
describe in such terms as affinities, friendships, love affairs, 
cliques, gangs, etc. These are at present the least tangible 
social units because of their informal, transitory, private and 
frequently secret character. They may even be illicit or ille- 
gal in nature. These groupings are not structurally embod- 
ied in any of the objective forms usually accompanying other 
forms of social organization, such as written constitutions, 
buildings, and other “material” paraphernalia, and they are 
charged with no formal functions by the larger community. 
Yet the basic nature of these nuclei and their profound influ- 
ence in determining especially the functioning of the more 
formal structures has always been recognized. The influence 
f these alignments is what we rely upon “insiders,” “dope- 
ers,” and gossips to convey to us, since objective and official 
accounts of the behavior of the generally recognized social 
groups contain no mention of these underlying influences. 
Yet it is admitted that the generally recognized behavior of 
a community can in a given case be adequately explained 
Only if these basic alignments are understood. Behind all 
present objective explanations of community behavior, 
therefore, there is always an aura of data, at present largely 
in the form of gossip, because of its subjective form, which 
we all feel constitutes the “real” explanation of what takes 
place. In the same way, the clever leader knows that the 
success of any program must take into consideration the cul- 
tural, psychological, emotional, and other clique alignments, 
especially of “key” people, the existence of which may not 
be consciously recognized by the people concerned, and cer- 
tainly not admitted by them. Failure to take these align- 
ments into consideration is generally recognized as a tactical 
blunder, lack of tact, diplomacy, etc. The objective study 
of these elementary social nuclei® is, therefore, quite basic 
*The concept of soci i i ved i y 
te reter to ear paaa IO A tod phn aceon fi Shel 
ggregates, such as metropolitan areas and geographic regions. 


Within a region or a metropoli ‘ i ds 
al politan area, for example, “the ulation ten 
to subdivide and become multinucleated in a poe en My of Teate that are 
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to an understanding and intelligent control of the commu- 
nity. d 
_ For purposes of illustration we shall group the above clas- 
sifications into two main categories: (1) those having to do 
with depicting social phenomena in geographic space, or the 
techniques of human ecology; and (2) those having to do 
with the fundamental structure of human groups from the 
standpoint of interaction. 


B. TECHNIQUES OF HUMAN ECOLOGY 


One of the most elementary and common methods of 
representing the relationship of social phenomena in geo- 
graphic space is the ordinary political map. It shows the 
concentrations of people in cities, administrative areas, and 
the chief avenues of communication. When we are inter- 
ested in a more accurate representation of the frequency of 
recurrence of certain phenomena, the familiar spot-map or 
cross-hatched map is used. The latter type of map shows 
the degree to which a stated phenomenon characterizes a given 
area, and such maps are the commonest and perhaps the 
clearest way of depicting cultural regions.*® 

Since cross-hatched maps usually represent rates rather than 
raw data, these maps can be used only when the data neces- 
sary for the computation of rates are available. A spot-map 
of juvenile delinquency, for example, may be made simply 
by placing a dot at the address of the delinquent. But if it is 
desired to compare different areas as to their delinquency- 
producing character, we obviously cannot compare merely 
the total number of delinquents in each area, if one area has, 
for example, ten times as many juveniles as the other. The 
rate of delinquency, not the absolute numbers, is in this 
case the significant comparison. That is, we are interested 
in the relative number of delinquents in the two areas, per 
hundred, per thousand, or some other number-unit of ju- 
veniles in the population. 
economically and socially integrated in a larger regional unity.” R. D. Mc- 
Kenzie, The Metropolitan Community, (McGraw-Hill, 1933), p- 49. F. S. 


Chapin also uses the term to describe certain institutions as “nucleat p 
(family, church, etc.) as contrasted with “diffused-symbolic” institutions (art, 


law, science, etc.). Op. cit., Chap. 2. 
10 Excellent illustrations will be found in Howard W. Odum and H. E. 


Moore, American Regionalism, (Holt, 1938). 


316 SOCIAL RESEARCH 


Rates are one of the commonest of statistical devices and 
in their simpler and commonly experienced forms are quite 
generally understood. But the selection of a satisfactory base 
for the computation of@rates is frequently a very difficult 
matter. The basic formula for a simple rate is 


The number of times a specific kind of event 
Boge occurs in a given period i 


The whole number of exposures to the risk of 
% its deotirrence 


To choose a proper denominator in this formula we need to 
know with what component of a situation the phenomenon 
we are studying is most stably and significantly correlated. 
This is why “the number of married women aged 15 to 45” 
is a better base for the computation of a birth rate than “total 
population” or some largely irrelevant or illogical base such 
as “national income,” “total bushels of wheat produced” or 
‘some equally unrelated phenomenon. There are many s0- 
cial events the frequency of which can be counted with the 
same accuracy as births. But for many of these events we do 
not know what common component of the situation is so 
highly and meaningfully associated as is births with the fe- 
male population aged 15 to 45. Dependable rates, there- 
fore, can be computed only after a good deal is known about 
the relationship of the phenomenon to other components of 
the situation. 
„ The simplest form of rates is found in such expressions as 
miles per hour,” “persons per [one] square mile,” “income 
per [one] year,” etc. In such cases as “births [per year] per 
thousand population,” “income [per year] per capita,’’*the 
numerator is itself a rate although in these cases we are s0 
accustomed to counting such events as births and income 
payments on a yearly basis that the time unit is usually not 
mentioned. The computation and interpretation of rates is 
a subject of great importance which we must leave to the 
proper statistical treatises.1 We have referred to them here 
only because they are basic to the representation of social 
phenomena in their geographic relationships as depicted by 
ecological maps, which are here our chief interest. The com- 


11For a good brief treatment, see F. E. C wden, Ap 
plied General Statistics, (Prentice-Hall. 1960). Chap. ry ie 
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putation of a significant rate may involve all the techniques 
described throughout this book, plus statistical operations 
which we have not considered. Ecological maps are devices 
for the display of the relationships between areas with regard 
to the characteristic expressed by the rate. In short, ecologi- 
cal maps represent a method of correlating certain phenom- 
ena ona geographic basis. 

What has been said aboye regarding the use of rates in 
ecological maps also applie$ to the use Of indices of social 
conditions as, for example, the indices of socio-economic 
status, efficiency of health departments, sanitary conditions, 
and others discussed in the preceding chapter. An index 
is a more general designation than a rate, consisting fre- 
quently of a combination of rates or other figures represent- 
ing degrees of phenomena in which we are interested. Thus, 
the Work Projects Administration in its division of the 
United States into rural cultural regions and sub-regions 
(see Figure 9) computed the following indices for every 
county in the United States: 1? 


(1) A rural-farm plane-ofliving index combining the average 
value of the farm dwelling, the percent of farms having automo- 
biles, the percent of farm homes having electric lights, the per- 
cent having running water piped into the house, the percent 
having telephones, and the percent having radios, 1930." 

(2) A rural-farm population fertility index constructed by 
computing the ratio of children under 5 years of age to women 
20-44 years of age, 1930. 

(3) Percent of farms producing less than $1000 gross income, 
1926. 

(4) Percent of farm tenancy, 1935. 

(5) Land value per capita of the rural-farm population, 1930. 

(6) Percent of farm produce consumed on farms, 1929. 

(7) Percent of rural families residing on farms, 1930. 

‘The factors of more localized importance were : 

(1) Percent Negroes constituted of the total rural-farm popula- 


tion in the South, 1930. ; 
(2) Percent “other races” constituted of the total rural-farm 


population in the Southwest, 1930. 


12A. R, Mangus, Rural Regions of the United States, (Works Projects Ad- 


ministration, Division of Research, 1940), Pp. 79-80. Pe n 
13C. E. Lively and R. B. Almack, A Met od of Determining Rural Social 


Sub-Areas with Application to Ohio, (Mimeograph Bull. No. 106, Ohio State 
University and Ohio Agricultural Experiment Station, Columbus, Ohio, Jan- 


ary, 1938). 
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In addition, physiographic features were taken into account in 
places where these were prominent elements.14 

Some people may object to this procedure on at least two 
grounds: (1) Not all of the factors are cultural in the nar- 
Tower meaning of that term, and (2) so many-sided a phe- 
nomenon. as the culture of an area cannot be adequately 
characterized by ten indices alone. Both objections are 
fundamentally of the same character, and spring from a fail- 


MA number of additional factors were used to descri dividual 
regions. See A. R. Mangus, op. cit, Tables 2 shan raged ssf 


Ma 


FIGURE 9. Rural Cultural Regions. 


From Rural Regions in the United States. By A. R. Mangus (Works Projects Administration 1941). 
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ure to recognize that the innumerable individual traits of the 
physical as well as the social world are nevertheless highly re- 
lated to each other in enormous clusters or constellations of 
traits. To the extent that the intercorrelation of traits in 
each constellation is high, a single one of these traits may be 
used as an index of the whole cluster. It is this fact which 
makes such a device as an index possible, as we showed in 
the preceding chapter. It is by virtue of this fact, also, that 
we may frequently infer and measure cultural situations in 
terms or by means of non-cultural indicators. If the ten in- 
dices enumerated above are sufficiently highly and reliably 
correlated with most or all of the other phenomena which 
we have in mind when we speak vaguely of the “many-sided- 
ness,” the “unique quality” and “spirit” of “a culture,” then 
these mundane indices are, for scientific purposes at least, 
perfectly valid devices for designating and defining culture 
areas. When these areas are depicted on a map, their geo- 
graphic relationship to each other and to other features indi- 
cated on the map becomes apparent. The discovery of such 
relationships is the principal quest of science in all fields. 

Numerous studies have been made to determine to what 
extent such indices as those listed above are correlated with 
each other in different parts of the United States. One 
study 1 took a preliminary list of 83 factors and through an 
extensive but incomplete factor analysis found that at least 
one-fourth of the 83 factors were correlated to the extent of 
60 or above, with three variables, namely, a plane of living 
index, a population fertility index, and a farm income index. 
Among the factors associated in a high degree with these 
three indices were such socially significant variables as mi- 
gration, age distribution, infant mortality, size of farm, farm 
tenancy, mortgage debt, part-time farming, one-room school 
enrollment, State school aid per pupil, and churches with 
full-time pastors.1¢ 

Commenting on the study resulting in the identification of 
the regions shown in Fig. 9, A. R. Mangus says : 

We are told that the regional distribution of primitive Ameri- 
can culture was discovered when archaeological specimens in the 


15C. E. Lively and R. B. Almack, op. cit. 
16 A. R. Mangus, op. cit, p- 79- 
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American Museum of Natural History were grouped according 
to their similarity and spotted on a map by place of origin. 
Similarly, it was found that when cards containing county ratios 
representing broadly significant factors are brought together on 
the basis of homogeneity, they represent counties that tend to fall 
together or in the same general locality. Moreover, they often 
represent counties that are known to be located within a distinct 
culture area... Those counties with a low plane-of-living index 
and simultaneously low population fertility, delimit the Missis- 
si pi Delta with its one cash crop (cotton) and its high incidence 
of Negroes and tenants. The extreme opposite of the Delta coun- 
try is found in Utah and Idaho, where a high plane of living is 
associated with a high ratio of children to women, and where a 
distinct culture based on religion is present. On the basis of 
analyses of this kind the conclusion was reached that the selected 
factors have sufficiently wide significance to serve as determinants 
of what may properly be called sociological or cultural regions 
and subregions. 

The division of cities into “natural areas” has also been 
carried out according to the same basic method described 
above. The tabulation of census figures for small local areas 
(census tracts),!8 which has been greatly extended in the last 
decade, enormously facilitates the computation of more sen- 
sitive indices than was true when such figures were available 
only for wards or other relatively large and non-homoge- 
neous areas. The major types of areas in a city, such as 

gold coast,” apartment house areas, rooming house areas, 
slums, business sections, warehouse areas, etc., can roughly be 
located, of course, without detailed statistical data, although 
these areas frequently shade into each other and can be char- 
acterized as of one or the other type only in degree. For less 
conspicuous characteristics of local areas, however, we are 
dependent on rates and indices, for example, of mortality, 
morbidity, delinquency, crime, nativity, education, and all 
the factors entering into socio-economic status. The possi- 
bility of computing rates for such phenomena for small areas, 
sometimes including only a few city blocks, has made possible 
a number of significant researches into urban patterns. One 
of these patterns is illustrated in Figure 10. 


1 Alexander A. Goldenweiser, “Cultural Anthropology,” in History and 
Prospects of the Social Sciences, edited by Harry Elmer arnes, (New York : 
Alfred A. Knopf, Inc., 1935), 


P. 244. i 
Prd Census of 1940 will be tabulated according to census tracts for sixty 


ait. 
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In this illustration, cases were first plotted on a base map 
according to home addresses, and rates were then computed 
for each census tract (not shown in Figure 10) using the total 
male population aged 10 to 16 years in each tract as a base. 
Rates for the square-mile areas were computed on the basis 
of population estimated from census enumeration districts 
and census tracts. Rates for each zone were secured by 
averaging the rates for each square-mile area falling within 
each zone.” The consistently declining rate of juvenile de- 
linquency as we proceed outward from the “Loop,” illus- 
trates the gradient in social phenomena. A gradient repre- 
sents a gradation in intensity —rate of change—in a social 
condition, varying with its distribution over a given area. 
Such gradients can readily be illustrated by drawing radial 
lines from the central business district of a city in different 
directions and computing the rate (birth, death, delinquency, 
socio-economic status or any other characteristic) for each 
census tract touched by the radial line. To the extent, as 
in the case illustrated, that rates vary consistently in any ob- 
servable way, at least an hypothesis emerges as to possible 
laws or generalized patterns of urban development. 

If similar studies in other cities confirm the findings, We 
may generalize and predict from such patterns. The usual 
reservation of “other things being equal,” necessary in all 
scientific generalizations, applies, of course, also to general- 
izations regarding urban patterns. It is not necessary tO 
assume that the pattern illustrated above is everywhere pres- 
ent. The technique described is equally useful for detect- 
ing and explaining whatever patterns are found, or in meas- 
uring deviations from any real or theoretical pattern we may 
adopt as a type. Researches on urban communities employ- 
ing the methods described above are admirable illustrations 
of a type of scientific analysis. If, in addition to the con- 
ventional demographic and social work data which are at 
present available, similar analyses were undertaken regard- 
ing the distribution of opinions, attitudes, and the subtler 
social characteristics of individuals and local communities, 

™C. R. , i „ 1929). 
| "y na pane ar ye rape wut Cine oT diference 


ctween zones, see C. C, Peters, “Note on a Misconception of Statistical Sig 
nificance,” Amer. Jour. Sociol., 59°231-96, 1933. 
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Ficure 11. Juvenile Delinquency Map. 


illuminating generalizations on urban communities might 
well result. 

A yariation of the method of radials and zones is the iso- 
metric map. Areas of equal rates are connected by lines in 
the manner that geographers represent varying altitudes and 
meterologists indicate areas of varying atmospheric pressure. 
(See Figure 11.) This technique avoids the impression of 
abrupt changes in rates at the boundaries of areas, which is 
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suggested by the ordinary cross-hatch map. It is true that 
the simple spot map also shows correctly the gradual change 
in the prevalence of some phenomena, but gives no indica- 
tion of the population base. The isometricymap is designed 
to correct both of these shortcomings. In the isometric map 
convenient points in the range of rates from the highest to 
the lowest are located by interpolation and connected by 
lines as shown in Figure 11.20 Areas bounded by any two 
isometers then represent belts within which the phenomenon 
charted shows a constant rate of occurrence. 

There are many ecological techniques which have not been 
mentioned above. All of them depend ultimately upon the 
location of phenomena in geographic space. Any organiza- 
tion or instruments which facilitate this task will make pos- 
sible scientific advance on a very important front. 

The most important single development in this direction 
is perhaps the widespread adoption of census tracts as a basis 
for gathering and registering social statistics. The spotting 
of cases in their proper geographic location is greatly facili- 
tated by a street index or coding guide which includes all the 
addresses in the city classified alphabetically by streets, and 
showing what house numbers on each street are in a given 
census tract. Schmid* suggests that the greatest speed and 
accuracy in locating cases according to addresses can be ob- 
tained by writing all the addresses to be spotted on separate 
cards and sorting them in alphabetical order (or serially for 
numbered streets) and then arranging the cards in the order 


2% For a detailed account of methods of constructing such maps the stu: 
dent should consult E. R. Mowrer, “The Isometric Map as a Techni ue of 


Social Research,” Amer. Jour. Sociol., 44:86-96, 1938. See also, Elsa S. Long- ' 


moor and Erle F. Young, “Ecological Interrelationships of Juvenile Delin- 
Nar Ee ail and Population Mobility,” amore our: iol 41:598- 
21 For the best brief summary of these methods, see Calvin F. Schmid, 
“The Ecological Method,” (Chap. 14 in Pauline V. Young, Scientific Social Sur- 
s and Social Research, Prentice-Hall, 1939). Special attention should be 
called to Schmid’s discussion of centrographical and related methods of de- 
termining the center of mean distances of spatial distributions, (¢.g- the 
nging “center of gravity” of population distribution or other data). For 
extended treatment of this subject, see “The Centrographical Met 
and Regional Analysis,” by E. E. Sviatiovsky and W. C. Eells, Geog- Revs 
27:240-54, 1937. See also, Louis Wirth and Margaret Furez (Editors), Loca 
Community Fact Book, 1938, (Chicago Recreation Commission, 1938). E 
Op. cit., p. 391. For a good illustration of one such index, see Calvin 5» 
Schmid, Census Tract Street Index for Minneapolis, Minnesota, (Minneapolis 
Council of Social Agencies, 1936). 
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of the house numbers on each street. Large numbers of in- 
dividual addresses can thus be located in their proper census 
tract by a single reference to the index. In the meantime, 
increasing amounts and kinds of social statistics will doubt- 
less be recorded and tabulated by census tracts, thus permit- 
ting the computation of rates by these small districts for all 
data so recorded. There is no doubt that the method affords 
tremendous possibilities for significant sociological analysis 
and for rigorous verification of sociological hypotheses which 
has hitherto been lacking in much social research.** : 


C. SOCIOGRAPHIC METHODS 


It has been noted that most of the ecological methods here 
described depend upon the availability of demographic sta- 
tistics for given geographic areas. The accumulation of these 
Statistics may involve any or all of the methods described in 
this book. The techniques of discovering and exhibiting 
the relationships of these data in geographic space are 
the additional features introduced by ecological methods. 
Broadly speaking, these methods are graphic devices for cor- 
relating social phenomena with the factor of geographic 
space. The charting of configurations of phenomena in time 
and space is perhaps the most basic general task of all science. 
The organism’s orientation to its geographical-spatial en- 
vironment is undoubtedly one of the earliest which it ac- 
quires, and is one of the most elementary of its adjustment 
techniques. It is not surprising, therefore, that relational 
thinking in every field and on every subject tends to struc- 
ture itself in spatial terms. 

Thus it happens that the description of purely social rela- 
tionships in spatial terms is common in perhaps all folk lan- 
guages. That is, it is very common to speak of high and 
low status in life, one person being below another, of social 
mobility, of social distance, social boundaries, social barriers, 
etc. These terms in such contexts clearly have no reference 
at all to geographic space. Yet they are just as meaningful 
ways of communicating social relationships as are lines on a 

23 For a striking use of census tract data in this way, see S. A, Stouffer, 
“Intervening Opportunities: A Theory Relating “Mobility and Distance,” 
Amer. Sociol. Rev., 5:845-67, 1940. 
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topographic map in communicating relationships of geo- 
graphic space. Hence it is, also, that many a lecturer on the 
most abstract philosophical subjects will resort to spon- 
taneous drawings on the blackboard in order to communi- 
cate more adequately certain relationships which have no 
geographic loci whatever. Certain branches of mathematics 
consist, indeed, of entire systems of such space-representa- 
tions in which positional relationships of any kind may be 
expressed. We have considered above, methods of repre- 
senting the relationships of social phenomena in geographic 
space, after the manner suggested by plant and animal ecolo- 
gists. In turning to the methods of representing social rela- 
tionships in spatial symbols and terms, therefore, we are not 
imposing a geographic analogy upon the social world. We 
merely adopt spatial symbols to represent relationships re- 
gardless of the types of subject matter among which the rela- 
tionships exist. 

The commonest cartographic device for representing social 
relationships is perhaps the familiar organization chart. 
city government, for example, is represented by a series of 
circles or rectangles connected by lines. A rectangular box 
at the top represents the city manager or the mayor ; another 
box on the same horizontal level represents the city council; 
below each are smaller rectangles representing officials “un- 
der” each of the above; lines from each figure on the first 
level show whether the mayor or the council controls each of 
the respective “subordinates,” etc. 

More complicated social relationships constituting other 
aspects of social structure may be represented as in Figure 12. 
This diagram shows how the status of men and women in 
one society varies with age, sex, and the various personal an 
occupational characteristics represented by each of the pet 
sons here charted. Different ways of achieving equal status 
is apparent. For example, a female child (3) aged 10 must 
be an “honored-child” to approximate the status of any 
boy (11). Likewise, an adult woman (7), if she is notably 
older, and a medicine woman, may surpass in status a mere 
adult male (1g) unless he is a war leader (14). Also a medi- 
cine woman (7) approaches the status of a medicine man (15) 
only if she is notably older. If the horizontal axis of this dia- 
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Low Status High Status 


Ficure 12, Comanche social structure (From Linton 24) “Age levels are rep- 
resented by the long crosslines. The vertical lines pues through either 
triangles or circles indicate, by their length, the total age range of individ- 
uals occupying the status. The position of the triangle or circle with relation 
to the total length of this line does not indicate the mode of age distribution 
among the individuals occupying the status, although this refinement could 
easily be introduced in cases where vital statistics were available for the soci- 
ety. The horizontal lines connected with circles or triangles indicate the 
range of prestige among individuals occupying the status. Similarly, the 
horizontal organization of the entire figure is based on the relative prestige 
of the various statuses represented.” 


gram consisted of a status scale in standard units, these rela- 
tionships could, of course, be represented by crude equations 
resembling ordinary mathematical expressions. Similar fig- 
ures could be constructed for any other coordinates as well 
as for age and status. Multi-dimensional figures might 
take into consideration additional determinants.” 

The relationships depicted in charts of this sort do not 
have to be metricized, however, in order to be meaningful 
or useful. Non-metrical charts and schematic diagrams have 
always been valuable devices for depicting relationships of 


24 Ralph Linton, “A Neglected Aspect of Social Organization,” Amer. Four. 


Sociol., 45:870-86, 1940. a ; 
25 See, for an example, F. S. Chapin, Contemporary American Institutions, 


(Harper, 1935), P- 352+ 
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various sorts. As methods of measuring the relationships 
involved develop, and as we desire to genetalize more and 
more broadly, we shall doubtless be forced here,’as in other 
fields, toward more refined mathematical symbolization and 
toward increasing use of tested methods of calculating and 
stating averages, deviations, and probabilities. In the mean- 
time, the diagrammatic methods are valuable analytical and 
expository tools in their present form. 
Figure 12 depicts one kind of social relationship or struc- 
e, namely, prestige or status. Cartographic methods have 
also been used to represent attraction-repulsion patterns in 
‘groups. Moreno** asked all the members of the State Train- 
ing School for Girls at Hudson, New York, to indicate their 
choices, in order from one to five, of the members of the com- 
munity with whom each would like to occupy the same house. 
The choices were confidential, and it was definitely under- 
stood that they would become the basis for a reassignment 
to different houses. This is a closed community with a popu- 
lation of between five and six hundred. The girls live in 
sixteen cottages, each with a housemother. Tabulation of 
the choices of desired housemates, including housemothers, 
revealed an intricate set of societal nuclei of five princi- 
palytypes. (1) The most elementary and definite nucleus 
was the mutual first choice of two or more individuals among 
themselves. This might be represented by figures of couples, 
triangles, squares, circles, or figures of more intricate dimen- 
sions. (2) Then there were the patterns most simply repre- 
sented by chains of a non-mutual character, as, for example, 
when A chooses B, B chooses C, and C chooses A. Such 
chains might, of course, consist of any number of individuals. 
(3) A third type of configuration resulted from a clustering 


Washington, D. C., 1934), 23 f, 97 is 
: y » + 875. See also, D. Thomas, A. M. Loomis. 
and R. Arrington, tional Studies of Social Behavior, (Institute of 


Bureau of Publications, Teachers Col Columbia Uni : web 
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of a large number of choices around a single individual, to 
some of which this “star” may respond by mutual choice, to 
others by indifference, and to others by repulsion. (This 
information regarding indifference and repulsion was se- 
cured through separate private interviews.) (4) A fourth type 
of nucleation, of great practical significance, was the phe- 
nomenon of the powerful as contrasted with the popular 
individual, and the phenomenon of indirect influence. Thus 
an individual may be the object of a large number of choices 
by people relatively isolated from the rest of the commu- 
nity. The center of such a cluster of choices may be very 
popular with her immediate group, but may not have very . 
much influence in the community because of the limited con- 
tacts of her admirers. On the other hand, an individual 
chosen by only a few may wield a powerful indirect influ- 
ence by virtue of the extensive connections of the few popu- 
lar persons for whom she is a center of attraction, Moreno 
cites one case, herself chosen by only four individuals, all of 
whose attraction she reciprocates. But these four are in 
their turn chosen by nearly one hundred individuals. This 
seems to be the sociological position of the “power behind 
the throne” who wields the real power through the person- 
alities of popular nominal leaders. As Moreno has pointed 
out, “the distribution of power in large groups depends upon 
the intricate distribution of emotional currents. An indi- 
vidual who is in control and can steer the course of one of 
these currents can wield an immense potential influence out 
of all proportion to his immediate following.” Finally, (5) 
there is the isolated individual, chosen by nobody, although 
he may choose some other persons.” Such individuals are 
almost certain to be maladjusted and to cause maladjustment 
in any group with which they have to live. Sometimes, to be 
sure, such individuals, not chosen by anyone as housemates, 
may, however, be chosen as desirable work companions, so 
that isolation is always with reference to a criterion. The 
completely isolated individual would be one who was not 
chosen by anyone as an associate in any of the activities or 

27 The five general types of grouping summarized above are, of course, 
subject to extensive variation, elaboration, and sub-classification. See Mo- 
reno, op. cit, 114-33 + 147 ff. 
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relations of a community. Such an individual could not be 
very sensitive to the behavior standards of the community 
because he is cut off from the currents which constitute the 
pressures by which these standards become operative on the 
individual, and hence he would almost certainly be a social 
problem. 

Now the individuals attracted to another individual re- 
garded as a nucleus,” together with the individuals to which 
this nucleus is attracted, any or all of which attractions may 
or may not be mutual, may be regarded as a social atom” 
with reference to any social interest, i.e., any adjustment 
need. Thus, each individual, provided he is not socially iso- 
lated, is the nucleus of one or more social atoms according 
to the number of relationships in which he is the center of 
attraction. He may at the same time be the nucleus of one 
or more atoms and part of the “electronic system” of other 
atoms. These atomic structures may therefore overlap in a 


tremendous intricacy of interrelationship, each individual | 


being at the same time part of numerous diverse structures, 
but in various degrees of completeness or intensity.*° 

As was to be expected, Moreno found that a knowledge of 
the societal energy patterns of his community enabled him to 
exp! events which occurred, such as a series of runaways, 
and, "What is more important, it also enabled him to organ- 
ize the community so as to relieve tensions, and to dissipate 
certain types of maladjustment. He also suggests the ap- 
plicability of these methods in the social as well as in the 
physical planning of other than institutional communities, 
for example, such as those conducted by the Resettlement 
Administration.’ 

It should be recognized at the outset that in all of these 
studies of the interpersonal structure of communities, we 
are not under the delusion that we are charting all the rela- 


% The nucleus is here 1 
ai defined merely as the center of converging societa 


™ See F. S. Chapin, op. cit, ch. 16; “$leasuring the Volume of Social 
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tionships that arè operating. To secure the basic data neces- 
sary for a complete account of the interpersonal relationships 
in a community, protracted study through participant ob- 
server and probably psychoanalytic and other techniques 
would be necessary. We may assume that in every commu- 
nity there are, in addition to the avowed attractions and re- 
pulsions, a large number of illicit, illegal, and so-called “sub- 
conscious” likes and dislikes. Information about these is 
not easily secured through questionnaires or formal inter- 
views. On the other hand, if we allege the existence of such 
relationships, the scientist must be prepared to indicate ob- 
jectively the behavior (verbal and/or other) on the basis of 
which he declares the relationships (including the “subcon- 
scious”) to exist. It may be sufficient for the artist to de- 
clare that he knows by intuition, introspection, or under- 
standing, that such unconscious relationships exist. But this 
will hardly satisfy a scientist, however valuable these hunches 
may be as sources of hypotheses. 

In short, we cannot take cognizance in science of relation- 
ships and social structures which are alleged to be so intangi- 
ble that we cannot observe them by any verifiable technique. 
Such relationships may be postulated as hypotheses, but they 
should not be mistaken for scientific knowledge. We need 
not claim, on the other hand, that the relationships which 
can be observed objectively and verifiably at present are the 
only ones that will ever be subject to such observation. We 
may assume, on the contrary, that increasingly subtle social 
relationships will be observed as our techniques improve. 
In the meantime, we are justified in proceeding with the sys- 
tematic analysis of those social relationships for which the 
data are readily available through questionnaires or brief 
interviews. 

The type of results which are possible through mere inter- 
viewing is illustrated in Figure 13. The relationships indi- 
cated in this diagram were discovered simply through a casual 
question at the end of an interview concerned primarily with 
material and cultural possessions involved in the measure- 
ment of socio-economic status as described in the preceding 
chapter. The question was: “Who are your best friends in 
this community, i.e., with whom do you most like to visit 
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‘socially’ ?” The patterns revealed by the replies received 
are depicted in Figure 13. * d 

In the whole community (population about 1000) eight 
such constellations, differing widely in pattern, were found.” 
These were analyzed from a number of viewpoints. Only a 
few of the more general results can be given here. (1) The 
groupings are correlated to some extent with socio-economic 
status. Groups of high socio-economic status were relatively 
homogeneous in this respect as compared with groups of low 
status. (2) The average number of choices made does not 
vary significantly with socio-economic status, but the average 
number of choices received varies uniformly and directly 


with the socio-economic score of the receiver. (3) No clear = 


occupational bases for the constellations are evident, as every 
constellation has a great variety of occupations represented 
in it. (4) Family relationship is not prominent in these 
groups. (5) Geographic factors are not prominently corre- 
lated with the groupings in this village. (6) Membership in 
a common church is one of the factors most clearly associated 
with these friendship groupings. (7) Ex officio friends of 
everybody, such as ministers of the gospel, do not figure 
prominently in these groupings. Politicians, physicians, and 
officials were reluctant to mention special friends, but re- 
ceived many choices. 


» D. SOCIOLOGICAL INDICES 


While charts and maps of the type illustrated in this chap- 
ter may be of great value in depicting a particular concrete 
situation, if we are to use these cases for scientific generaliza- 
tions, we must develop methods of classifying, comparing, 
and summarizing them in terms of the frequency and the 
probability of their occurrence. It is in this connection that 
we become interested in the development of quantitative 
indices of the degree to which the observed relationships tend 
to appear and vary under different conditions. We saw above 
how some such indices have been developed in the case of 
ecological maps depicting geographic relationships. The de- 

32 Fora nL iento and analysis of the data, see G. A. Lundberg and 


on Steele, “Social Attraction Patterns in a Village,” Sociometry, 1:375°419 
1995. 
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Ficure 13. Friendship constellation in a village. The most important 
“friendship” constellation in the village, centering on the matriarch of the 
leading family with her principal satellites, a physician-politican (M 16) and 
a banker's wife (B 7). Each person is represented by a circle. The letter 
and the first number in the circle is the code symbol of the person. The 
second number is that person's score of socio-economic status on the Chapin 
Scale. Each arrow represents a choice made or received according to the 
direction of the arrow. Mutual choices are represented by double-headed 
arrows. (On account of charting difficulties the following lines are omitted 
from this chart; Me 7 and M 2 to Pr 14; We 1 and W 16 to HI 3.) 
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gree to which many kinds of social phenomena are associated 
with these geographic patterms is thus indicated. What is 
more, for scientific purposes, the degree to which a relation- 
ship is observed to hold in one case can be compared and 
generalized, by means of these indices, for human communi- 
ties in general, or at least for communities in fairly homo- 
geneous cultures. If in addition to the facts regarding the 
frequency and direction of choices depicted in our social 
structure charts, we could develop also indices of the inten- 
sity of these attractions (rot to mention repulsions), we 
could determine with greater accuracy how general the re- 
lationships are. The accurate determination of how these 
relationships vary with other factors also awaits the develop- 
ment of reliable indices of the relationships and the factors 
which we think are relevant in producing observed effects. 
_ We conclude, therefore, with a brief consideration of some 
such proposed indices, Some of them are frankly heuristic, 
and only further research can determine to what extent the 
phenomena to which these indices refer are of importance 
in sociological analysis. 

1. Suppose it is desired to compare different groups on the 
basis of the amount of interaction taking place between the 
members of each group, as measured by any observed be- 
havior, In Figure 1, for example, the number of choices of 
friends is taken as one indication of interaction. An index 
of interaction representing the percentage of total possible 
choices actually exercised between two or more people may 
be computed by the following formula : 


(expressed as a per- = Total choices made x 100 
centage) = 
The maximum number of choices ible among a given 
number of individuals is the square of the Aer aag of indi- 
viduals in the group, minus this number, or N? — N. Thus, 
the maximum number of choices possible between two indi- 
viduals is 2, among three individuals 6, among four indi 
viduals 12, etc. If, in the last case, the number of choices 


actually made is g, the index of interaction is 9* °° — 75% 
2. The above index measures the degree of interactivity 
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(as indicated by the criterion of choices) among the members 
of a group. It tells nothing about the forces drawing that. 
group apart or outward as balanced against the forces that 
hold it together, namely, the forces of intra-activity plus the 
forces pressing inward upon it from the outside. This lat- 
ter phenomenon (accepting again expressed choices as an in- 
dication of it) may be measured by a compound index of 
cohesion or measure of centripetal-centrifugal tendency com- 


puted as follows : 
s 


Number of choices made among members of an in-group plus 
the number of choices received by them from outside ê3 
Number of people in the in-group 
L 
Number of choices made outward by the in-group 


Number in the out-group receiving choices from the in-group 


Or, 
Per capita choices within a group plus per capita 
choices received from outside 


EE EOS GUE OE RBS AOE E 
Per capita choices received by an out-group from 


the in-group 
Or, 
Ca + Ci 
EE E O a RE, 
Co Er ICo 


O 


83 This formula assumes that incoming choices are a cohesive or binding 
force in their operation upon the group receiving them. While this is doubt- 
less true in some studies, it need not always be true. For example, if the 
in-group should consist of highly marriageable men and the choices should 
come from marriageable women, the incoming choices might readily be a 
disruptive force as far as the in-group is concerned and should therefore be 
subtracted rather than added in this numerator. I am indebted to Dr. Paul 
Lazarsfeld for this suggestion. He also points out that the formula as it 
stands gives great weight to the concentration of the in-group’s outgoing 
choices as represented in the last term of the formula (last denominator), 
and suggests that this factor could be given a more moderate and propor- 
tional weight by changing the jep sien En the formula to read 

1 
Tii 
Co Ci 
o + I 


$ 
For other proposed indices of the phenomenon here under discussion to- 
her with some results of their application to actual groups, see J. L. Mo- 
reno and Helen K. Jennings, “Statistics of Social Configurations,” Sociometry, 
1:942-74, 1938. See especially pp. 367-74- 
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In which 

Number in the in-group 

Number in out-group receiving choices from the in-group 
Number of choices coming in 

Number of choices going out 

Number of choices among members of the in-group 


Q 
IERSE 


Take, for example, an in-group of six members who make 
a total of 17 choices among themselves and receive 12 from 
the outside, a total of 29, a per capita of 4.83 choices. This 
is the numerator, a measure of cohesion within the group and 
pressing on it from the outside. This in-group makes six 
choices toward an out-group consisting of six persons, thus scat- 
tering its outgoing choices very thinly, namely, 1.0 choices 
per individual in the receiving group. The index of cohe- 
sion is accordingly 4.83 (4:83 + 1.00). If these six outgoing 
choices had been concentrated on, say, one individual instead 
of six, the denominator in our fraction would have been 6.0 
instead of 1.00, and the index of cohesion of the central 


group would be = » or 80. This would indicate (when- 


ever the index of cohesion is less than unity) that the six 
members of the inner group actually are more strongly drawn 
toward the hypothetical individual assumed than to the 
members of the in-group, and should therefore be considered 
as satellites of the hypothetical individual. This index also 
indicates whether the peripheral group is more closely drawn 
toward the center of a group, such as is represented in Figure 

13, or toward the margin. It is a sort of indicator of a group 
boundary in that it indicates at what point more forces draw 
a group toward one center rather than to another. 

è We are not concerned in the above indices with the ques- 
tion of whether expressed choices of friends as secured in a 
particular investigation is a desirable or a reliable indicator 
of interaction or cohesion. The proposed index merely 


quate, the onl, or the best indicator of group interaction or 
cohesion. That is a matter for research to determine. If 
these indices can be shown to vary significantly and consist- 
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ently with other sociologically important group differences 
of the type that are commonly designated as characterizing 
“closely knit,” “ethnocentric,” “self-centered,” “self-suffi- 
cient,” “homogeneous” in-groups, then the proposed indices 
may be used as measures of the degree to which groups have 
these characteristics. All indices are merely short and facile 
devices of this kind. In any event, there is no mystery about 
these indices. They specify exactly what they take into con- 
sideration, and how they take the items into consideration, as 
contrasted with such words as cohesion and interaction. 

3- One of the principal problems that arises in the anal- 
ysis of sociograms showing the distribution of choices, or 
other evidences of positive or negative interaction, is to de- 
termine to what extent the observed configuration differs 
from the interactions that may occur on a purely chance 
basis. For example, Moreno and Jennings * report the fol- 
lowing experiment : 

Twenty-six girls are allowed three choices each as to what 
members of the group they would prefer for dining-table 
companions. These data are secured for seven such groups. 
The average number of people receiving o, 1, 2, Sie we 
choices is computed. The question now arises how these 
results differ from what would be secured if, instead of mak- 
ing deliberate choice, each girl had drawn at random three 
names from an urn containing all the names but her own. 
In either case we would expect a certain number of people 
not to be chosen at all, a certain number of reciprocal choices, 
a certain number of mutual choices, etc. Only to the extent 
that the deliberate choices vary significantly from the con- 
figurations produced by chance do we have evidence of so- 
cial attractions and repulsions in the group. The actual re- 
sults secured in this experiment by (a) deliberate choice, (b) 
random drawing of names and (c) theoretical expectancy 
computed by formula * were as follows: 


84]. L. Moreno and Helen H. Jennings, “Statistics of Social Configura- 
tions,” Sociometry, 1:342-74, 1938. 

35 The computations shown in the table were made by Dr. Paul Lazarsfeld, 
(For full details see J. L. Moreno and Helen H. Jennings, op. cit., pp. 349- 
51.) Under the conditions stated, the probability of a certain person being 


chosen by any other person is p = 3: the probability of not being chosen is 
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TABLE VI 
Frequency of Types of Relationships in a Group 


Average Number of Each Type of Relationship 


Theoretical 
Chance Ex- 


Deliberate 
Choice pectancy 


kot 
68.6. 
: n 1.68 a 
Chain relations $ ” y not comput 
Closed structures (triangles, etc.) 3 not computea 
er structures i not compute 


It will be observed that when the girls choose their table 
companions deliberately, about four times as many people 
are not chosen at all as would be the case if chance alone 
were operating. Likewise, deliberate choice results in about 
three times as many mutual choices as chance alone would 
produce. Conversely, and as might be expected from the 
above results, the average number of unreciprocated choices 
is notably less under deliberate choice than under condi- 
tions of chance. The extent to which the observed configura- 
tions thus differ significantly from chance configurations in- 
dicates the degree to which social attractions and repulsions 
are genuinely operative. 

4. A sociation index (S) for groups and individuals;is sug- 
ES Res T AS CN N 
q=1—p=f; the probable number of persons who, by mere chance, 


would be picked by their fellows not at all, once, twice, etc, is given by 


carrying out the binomial expansion 


22, 3\% 
(G+) 
and multiplying by the number of persons. The general formula for n per- 


sons, each permitted a choices, is (Pp + q)*1 in which p = 


=i—Pp- 
adq P 


The probability that any child is mutually chosen by any other child is 
given by the general formula m = n(n —1) X pt 


2 

The probable frequency of unreciprocated choices by mere chance is given 
by the formula u=n(n—1)pq. See C. H. Forsyth, An Introduction to 
Mathematical Analysis of Statistics, (Wiley, 1924), Chap. 9. 
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gested by Zeleny** as follows: Each member of a group is 
asked to mark the names of every other member with the 
words “‘like,” “dislike,” “indifferent,” according to the atti- 
tude felt toward working in class groups with the other mem- 
bers of the class. (The method is, of course, equally usable 
with regard to any other attitudes or groups.) In addition, 
the members are asked to indicate their first five (or other 
number of) choices of co-workers and also their last five (or 
most objectionable) choices. The sociation index is the 
average of the units of intensity of all the interpersonal at- 
titudes in a group (I), plus or minus the average deviation 
of the intensities of the attitudes from I. This deviation is 
represented in the formula by (D). That is, 

S (sociation index) = Ī + D. 

Intensity of attitude is in the present illustrations assigned 
only two gradations : each acceptance (or rejection) received 
by a person is assigned one half unit of intensity (+ 0.5 or 
— o.5).37 If in addition to an acceptance (+ 0.5) a person 
receives also a “choice” (i.e., is not only acceptable but is 
indicated by someone as among his first five choices in point 
of desirability) he is credited with an additional half unit of 
attraction (+ 0.5 + 0.5 = 1.00). Likewise, if a person re- 
ceives a rejection (— 0.5) and also a “last choice,” (— 0.5) 
this gives an (I) of — 1.00. The sum of these intensities 


56L, D. Zeleny, “Measurement of Sociation,” Amer. Sociol. Rev., 6:173-88, 
1941. For an actual community study dealing with the problem here under 
consideration, see C. P. Loomis and D. M. Davidson, Jr., “Measurement of 
the Dissolution of In-Groups in the Integration of a Rural Resettlement 
Project,” Sociometry, 2:84-98, 1939. For some important considerations re- 
garding the statistics of configurations and the normal distribution curve, 
see Helen H. Jennings, “Quantitative As of Tele Relationships in a 
Community,” Sociometry, 2:93-100, 1939. For a more complete and general 
discussion of the theory of measurements of this kind, see S: C. Dodd, “A 
Tension Theory of Societal Action,” Amer. Sociol. Rev., 4:56-77, 1939. See 
also the same author’s Dimensions of Society, (Macmillan, 1941). 

37 “No attempt is made here to distinguish between the intensity of first, 
second and third choices; further research may or may not make these dis- 
tinctions possible. This means, too, that it is not known whether a second 
half unit of intensity (a choice) is the equivalent of twice the first half unit 
of intensity, the square of it, or some other multiple. For the present, a 
linear relationship is assumed ; and because of the satisfactory results so far 
obtained this relationship is assumed to be suitable for practical use at least. 

“Note also that the unit of magnitude of intensity of an attitude is so 
chosen that the sociation index, S = 1+ D, may have unity for its maximum 
value and thus vary in magnitude from (+-1.00 + 0.00) to —1.00 + 0.00.” 
(L. D. Zeleny, op. cit., p. 174) 


340 SOCIAL RESEARCH 


(2I) divided by the total number of possible attitudes (n) 
in the group gives the average of all thé units of intensity — 
4 i 


(I), ie., = = T, 


The average deviation of I is computed in the usual man- — 
ner, namely, 
zd_=(Be))_ EARI 
ONNE) average deviation 
in which N is the number of persons in the group. In sum- 
mary, + 
Sat ua =I z (I ~I) TA 
+D NCD NN UN | 
Suppose, for example, that in a group of three people, the 
attitudes expressed are as indicated in the numerator of the 


equation below. The sociation index would be computed 
as follows : 


= ZI _~+1+05+0+4(—0.5) +0+05 
ED) 368-1) rT 
= 0.25 units. 


paZi~) _ “75 + -25 +25 + -75 + .25 + .25 _ 2.50 
NNi 3X2 EUA 
= 0.42 units. 


Hence, S = I + D = + 0.25 + 0.42.33 


5. Zeleny goes on to show how the sociation index be- 


comes a Social status index for an individual by a simple 
modification : 


wZ2I,Zd_ 1 T A aA E 
OU, ae eN 
This is the same as the sociation formula, except that N— 1 
is substituted for N(N — 1) since we are considering here 
only the attitudes of other members of a group to a certain 


aote that the Average Deviation, although written after the calculated 


in the manner employed for indicating certain limits of observa 

Tor in science generally, here merely indicates the average amount of devi- 
ation of each intensity measure from the mean of all of these measures. The 

standard deviation may of course be used instead of the average deviation by 


substituting 
vz for za, 
n n 
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individual, and neglecting this individual’s attitudes toward 
other members. To illustrate, a person receives 2 half units 
of attraction from" person B and one half unit of repulsion 
from person C. Then his social status in that group as meas- 
ured in terms of the social units is 


sS=I4D<2 I + 2()~T +1-+(— 0.5) 4, 9-75 +0.75 
N-11 N-i 3-1 2 
= -+ 0.25 + 0.75 units. 

SS’s may range from + 1.00 + 0.00 to — 1.00 = 0.00.99 

6. Finally, the sociation index becomes 4 social adjustment 
index by the following modification : 

ps Wate Hts | ae So A. Z(I~ĪI) 
E E EET EAN 

Social Adjustment is here defined as the average intensity of 
mutual attractions and repulsions between a person and each 
of his associates in a group, plus or minus the average devia- 
tion of the measures of intensity of attitude around the mean. 
We are here concerned with the attitudes of the individual 
in question toward his associates as well as with the attitudes 
of the others toward him. Hence the base for the index 
becomes 2(N — 1) instead of N(N — 1), as in the index of 
sociation, and (N — 1) as in the social status index. (N — 
1) represents the number of attitudes a group may hold to- 
ward a person; since in a mutual relationship the person 
concerned may hold an equal number of attitudes toward the 
members of the group, 2(N — 1) represents this number. 

Zeleny has shown how these indices can be used in social 
experimentation by rearranging groups and observing the 
effect as measured by the indices as well as by other criteria 
tending to validate the indices. Others have used similar 
techniques in therapeutic measures for maladjusted groups.” 


39 The status of a person in a community might be the average of his 
Statuses in all the groups to which he belongs. e present illustration does 
not deal with this problem, however. See L. D. Zeleny, “Measurement 
of Social Status," Amer. Jour. Sociol., 45:576-82. 1940. On the measurement 
of morale, see also D. C. Miller, “Morale of College Trained Adults,” Amer. 
Sociol. Rev., 5:880-89, 1940. Also E. A. Rundquist and R. F. Sletto, Per- 
sonality in the Depression, (Univ. of Minnesota Press, 1936). 

40 See Leona M. Kerstetter, and J. Sargent, “Reassignment Therapy in the 
Classroom,” Sociometry, 3:293-306, 1940. Also Joan H. Criswell, A Socio- 
peste Study of Race Cleavages in the Classroom, Archives of Psych., New 

‘ork, 1939. 


342 SOCIAL RESEARCH 


It should be emphasized again that these indices do not pre- 
tend to designate or measure all of the Aspects of the phe- 
nomena denoted or connoted by such Words as sociation, 
interaction, status, or adjustment. On the other hand, the in- 
dices designate precisely what they do take into considera- 
tion, and represent the kind of tools of analytical thinking 
which every science seeks. Whether and to what extent 
these indices prove useful in sociological analysis and social 
control is a question to be settled by future research of the 
type mentioned above. 
a An index of Social turnover or metabolism.—The term 
social metabolism has been used to designate the rate of shift 
in the personnel composition of groups. It is a general term 
for what in a special casein economics is called labor “turn- 
over.” E. Y. Hartshorne, after reviewing the various in- 
dices that have been proposed in this field, suggests the fol- 
lowing stability index: 
Tea 2R 
stability index = ta tC. X 100. 
In which 
G, = the size of a group at the time subsequent to which we 
wish to measure change. 
G, = the same group, (nominally, at least) after change dur- 
eel aber ene tea some or all az u 
ejar ve been lost and new ones 


R = the residue who have survived the period of change. 


That is, if a university has a faculty of 615, at one time, loses 
304 members, leaving 311, and adds 14 new ones, thus hav- 
ing 325 at the end of the period, its stability index for the 
period is: 
2X 31 622 
615 + 325 X 100 = 540 X 100 = 66.1%. 


It ‘will be observed that 100% stability (or 0% turnover) 
would be shown by this index only if R, G,, and G, were 
the same. Likewise 0%, stability (or 100%, turnover) would 


41E. Y. Hartshorne, “Metabolism Indices and nnexa Austria : 
A Note on Method,” Amer. Jour. Sociol., a, ue ok 
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be shown only when R = zero, i.e., when all members are re- 
placed. » 

As in the'case of all indices, this index does not pretend to 
describe all significant aspects of the metabolic or turnover 
process. The operations involved in its computation show 
exactly what aspects it does take into consideration, and it 
should not be expected to measureyanything but these as- 
pects, and whatever aspects may be Gian to correlate to a 
stated degree with those measured by the index. If this prin- 
ciple were kept more generally in mind both in inferential 
reasoning from indices and in criticisms"of social indices, a 
great deal of futile controversy would be avoided. 


E. SUMMARY AND GONCLUSION 


The discovery of systematic relationships among the multi- 
tudes of events that occur in, among, and around us is, as 
we have seen, the central problem of all science. This quest 
has been immeasurably advanced through the use of written 
symbols by means of which the uniformities of recurring 
events have become demonstrable. By means of symbols, 
also, enormous areas of data are brought together in suffi- 
ciently compact form to reveal the interrelationships be- 
‘tween the parts. For example, a city map the size of this 
book page reveals at a glance geographic, ecological, and 
demographic relationships which could not be apprehended 
in a long time by a casual stroller. We considered, accord- 
ingly, in the first part of the chapter, the cartographic devices 
which have been found useful in depicting social conditions 
and relationships, against a background of geographic space. 

We found in the second place, that cartographic tech- 
niques were also useful in depicting purely social relation- 
ships without any reference to geographic space. Charts of 
social organization and functioning are almost as common as 
geographic maps. These social structures and functions are 
conceived to exist in social space, as when we speak, for ex- 
ample, of the president being above the members of his cabi- 
net, the city manager being under the city council, etc. These 
spatial designations indicate certain social relationships and 
functionings, and in this way are as useful as spatial designa- 
tions indicating which part of a city, or which army, is above 
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another in a purely geographic sense. In the same way, we 
saw that social structure, status, interpersonal relations, and 
many so-called “abstract” and “intangible” relationships may 
also be depicted objectively by geometric, topological, and 
sociometric diagrams and charts, 

Finally, we showed that as we wish to refine and sharpen 
the content of the patterns found by the above methods, and 
to generalize our observations for scientific purposes, we 
need indices denoting the extent, degree, and particular as- 
pects of the relationships, roughly indicated in the charts. 
That is, a map shdWwing the relative positions of two cities is 
of importance if we want to build a railroad between them. 
If, in addition, the map shows in standard units the distance 
in miles and the exact location and elevation of hills and val- 
leys, this adds vastly to the usefulness of the map. Likewise, 
a simple diagram of certain relationships among members 
of a particular group may be useful for some purposes. But 
if we wish to compare this group with others, and especially 
if we want to arrive at generalizations about the observed 
relationships, we need symbolic devices designating exactly 
what we are comparing, such as the kind and intensity, of the 
relationships. Indices are designed to supply this need. 

Indices —rates, coefficients, and formulas of all kinds —are 
especially designed to symbolize briefly and precisely im- 
portant relationships among phenomena. Accordingly, all 
sciences strive to develop reliable indices of the phenomena 
with which they deal. A vast variety of indices for specific 
uses have, therefore, been developed in every field, includ- 
ing the social sciences. The practical demands of business 
and education have greatly stimulated these developments 
in economics and psychology. We have in previous chapters 
considered at some length the attempts to develop similar 
indices of important social relationships. The additional 
illustrations given in the present chapter, together with those 
discussed in other chapters, do not, of course, represent a 
complete list of social and sociological indices now proposed 
and in use? We have aimed rather to illustrate recent at- 

42 The most comprehensive Proposals for indices of a very large variety of 


sociological phenomena will be found in $, C. Dodd, Dimensions of Society, 
(Macmillan, 1941). See especially Chap. 10. 
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tempts to develop indices of phenomena hitherto considered 
largely Bis by more precise and objective meth- 
ods. 

We have warned repeatedly against taking a mystical view 
of indices, and of using them blindly. Mysticism about in- 
dices is found perhaps chiefly among those who assume that 
because an index is called by some familiar word, it neces- 
sarily measures all of the things which that word means in 
its vague folk usage. The requirements of definiteness and 
precision in an index probably preclude the possibility that 
its denotation should ever be the same asthe denotation and 
connotation of a particular folk word or phrase. When this 
fact is generally realized, we shall cease to expect indices to 
give us the same meanings as folk terms. We shall strive 
instead to develop indices that will define sociological con- 
cepts in terms of the operations involved. The usefulness of 
these definitions will be demonstrated, if at all, by the results 
they yield; by the extent to which they enable us to reach 
significant, verifiable generalizations. In this way indices, 
like other tools, although themselves the products of analyti- 
cal thought, will become in turn the instruments of more 
refined analysis. 


F. SUGGESTIONS FOR FURTHER STUDY 


1. One of the best examples of the use of a variety of 
methods in human ecology is Calvin F. Schmid, Social Saga 
of Two Cities, (Minneapolis Council of Social Agencies, 
1937). A good brief treatment of these methods will be 
found in the same author's “The Ecological Method in Social 
Research,” (Chap. 14 in Pauline Young, Scientific Social 
Surveys and Social Research, Prentice-Hall, 1939). 

2. For a remarkable but somewhat technical example of 
the usefulness of ecological data in the verification of socio- 
logical hypotheses, see S. A. Stouffer, “Intervening Oppor- 
tunities: A Theory Relating Mobility and Distance,” Amer. 
Sociol. Rev., 5:845-67, 1940. 

3. The following sources, in addition to those reviewed in 
the text, present illuminating examples of both methods and 
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results of sociometric analysis of social interaction in human 
% 
groups : A 


(a) J. L. Moreno, Who Shall Survive? A New Ap- 
proach to the Problem of Inter-human Relations (Nervous 
and Mental Disease Publishing Co., Washington, D. C., 
1934)- 

(b) F. J. Roethlisberger and William J. Dickson, Man- 
agement and the Worker, (Harvard University Press, 
1939). (This excellent monograph employs a variety of 
meéthods. See especially Chaps. 20-22.) 

(c) C. P. Loomis and D. M. Davidson, Jr., “Measure- 
ment of the Dissolution of In-Groups in the Integration 
of a Rural Resettlement Project,” Sociometry, 2:84-94» 
1939. 

(d) C. P. Loomis and D. M, Davidson, Jr., “Sociometrics 
in the Study of New Rural Communities,” Sociometry, 
2:56-76, 1939. 

(e) C. P. Loomis, “Informal Groupings in a Spanish- 
American Village,” Sociometry, 4:36-51, 1941. 

(f) Joan H. Criswell, A Sociometric Study of Race 
Cleavages in the Classroom (Archives of Psychol., No. 235» 
New York, 1939). 

(g) W. I. Newstetter, M. J. Feldstein, and T. M. New- 
comb, Group Adjustment, A Study in Experimental So- 
ciology, (Western Reserve University, Cleveland, 1938). 

(h) Helen H. Jennings, “Structure of Leadership — De- 
velopment and Sphere of Influence,” Sociometry, 1:99-143) 
1937. 

(i) Mary L. Northway, “A Method of Depicting Social 
Relationships,” Sociometry, 3:144-50, 1940. 

(j) D. C. Miller, “Evaluative Research in Group Discus- 
sion,” Sociol. and Social Research, 25:213-25, 1941. 

(k) F. S. Chapin, Contemporary American Institutions, 
(Harper, 1935), Chaps. 2-6, 16-18. 


4. For an excellent summary of sociometric work, see F. S. 
Chapin, “Trends in Sociometrics and Critique,” Sociometry, 
3:245-62, 1940. A general annotated bibliography of socio- 
metric techniques of group analysis has been compiled by 
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J. G. Franz, “Survey of Sociometric Techniques with an 
Annotated Bibliography,” Sociometry, 2:76-90, 1939. 

5. Extensive development of theoretical and quantitative 
aspects of sociometrics will be found in S. C. Dodd, Dimen- 
sions of Society, (Macmillan, 1941). For briefer treatment 
of particular aspects, see his “A Tension Theory of Social 
Action,” Amer. Sociol. Rev., 4:56-77, 1939; his “The Inter- 
relation Matrix,” Sociometry, 3:91-101, 1940; Helen H. 
Jennings, “Quantitative Aspects of Tele Relationships in a 
Community,” Sociometry, 2:93-100, 1939; E. D. Chapple, 
“ ‘Personality’ Differences as Described by Invariant Prop- 
erties of Individuals in Interaction,” Proceedings of the Na- 
tional Academy, of Sciences, 26:10-16, 1940. A more com- 
plete account of this technically difficult study will be found 
in the same author's monograph (in collaboration with C. 
M. Arensberg), Measuring Human Relations: An Introduc- 
tion to the Study of the Interaction of Individuals. Genetic 
Psychology Monographs, 22:3-147, 1940. 

6. On topological methods in psychology and sociology, 
see J. F. Brown, Psychology and the Social Order, (McGraw- 
Hill, 1936), Chaps. 3 and 4 and Appendices A and B. For 
more extensive treatment, see K. Lewin, The Conceptual 
Representation and the Measurement of Psychological Forces, 
(Duke University Press, 1938). 

7. On social indices and their uses, see P. F. Lazarsfeld, “In- 
terchangeability of Indices in the Measurement of Economic 
Influences,” Jour. of Applied Psychol., 23:33-45, 1939. See 
also B. Buell and R. Robinson, “A Composite Rate of Social 
Breakdown,” Amer. Jour. Sociol., 45:887-98, 1940. Also 
F. S. Chapin, Contemporary American Institutions, (Harper, 
1935), Chap. 20, “A Measurement of Personal Intimacy.” An 
excellent discussion of the function and importance of in-: 
dices in the social sciences with illustrations from the work 
of Durkheim will be found in Harry Alpert, Emile Durkheim 
and His Sociology, (Columbia University Press, 1939), PP- 
119-27. 

8. For a very important type of social measurement not 
treated in this book see G. K. Zipf, National Unity and Dis- 
unity, (Principia Press, Bloomington, Ind., 1941). See also 
F. H. Allport, and R. S. Solomon, “Lengths of Conversations : 
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A Conformity Situation Analyzed by the Telic Continuum 
and the J-Curve Hypothesis,” Four. of Abnorm. and Soc. 
Psychol., 34:419-64, 1939. Also A. Jenness, “A Critique of 
the Telic Continuum and the J-Curve Hypothesis,” Psychol. 
Bull., 36:531-32, 1939. Also F. H. Allport, “The J-Curve 
Hypothesis of Conforming Behavior,” Four. of Social Psychol., 
5:141-83, 1934. f 

9. On indices, see P. Lazarsfeld, et. al, The People’s 
Choice (N. Y., Duell, Sloan & Pearce, 1944), Appendix B. 
Also Magazine Marketing Service, 383 Madison Ave., N. Y., 
M. M. S., County Buying Power Index. 


CHAPTER XI 


FIELD WORK: THE INTERVIEW AND THE 
SOCIAL SURVEY 


Let us assume, however, that we are now satisfied that we know where to 
find, and have set up a sample truly yi robs of the country as a whole, 
and that we have found words and phrases which clearly and without bias 
convey our meaning. We are still confronted with the enormous job of get- 
ting the actual field work done... No matter how skillfully drawn the ques- 
tionnaire is, it can fall down if it is sent to fifty deserving college juniors p 
eeM who have no feeling of the great importance of their part of the 
work and who have not been thoroughly trained... We are often asked what 
kind of people make the best interviewers and find it very difficult to “type” 
them. They must all be honest; they must all have personalities sufficiently 
attractive to let-them sell respondents on the idea of giving an interview ; they 
must realize the importance of letting no inflections creep into their voice 
which might influence answers; and of course, they must be possessed of a 
good strong pair of legs! We are materially aided, however, in our efforts 
to secure competent people W the fact that the work itself is extremely in- 
teresting and appeals to intellectually curious and intellectually honest peo- 
ple. Until last year one of our interviewers was one of Hollywood's foremost 
character actors who stated that our work gave him a better opportunity to 
study character types than anything else he did. Another was the vice- 
president of a nationally-known manufacturing concern who liked to spend 
all of his week-ends interviewing because of his genuine interest in sociology. 
— Elmo Ropers 


A. INTRODUCTION 


The major theme of this book has been the need for the 
accurate observation and recording of first hand data and 
their reduction to valid generalizations. We have treated 
(1) the importance of definite units and objective terminol- 
ogy (Chapter III); (2) the determination of the scope and 
purpose of our investigation; that is, the formulation of a 
working hypothesis and the selection of a representative sam- 
ple (Chapters IV and V); and (3) the construction of me- . 
chanical devices in the form of schedules of various kinds to 
help us objectively to observe, record, and compare data 
gathered from original sources (Chapter VI to X). There re- 
mains the consideration of the actual field work. 

We saw in Chapter VI that data may be secured either from 
documentary sources or from field work. Field work we 
classified as of two principal kinds: (1) direct observations 


1“Sampling Public Opinion,” Jour. Amer. Statis. Asso., $5:329-3% 1949. 
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of any social phenomena, including verbal behavior ; (2) the 
oral or written testimony of persons, when our interest is in 
the situations represented by the verbal responses rather 
than in the language behavior itself. When our interest is 
primarily in the verbal behavior per se, its observation be 
longs, of course, to the first category. It is recognized that 
the pursuit of documentary sources may involve field work, 
and to the extent that such is the case, the technique of field 
work would apply also to research from documentary sources. 
But since the technique of securing data from documentary 
sources has been relatively highly developed by the histo- 
rians, and has been discussed in a previous chapter (Chapter 
IV), we shall confine our efforts in the present chapter to a 
consideration of the technique of gathering social data by 
direct observation of the behavior, verbal or otherwise, of 
persons and groups, and of their significant environmental 
conditions. 

__ The best way to learn the technique of efficient field work 
is to participate in an actual field project under a capable 
director. The next best way is to study the printed reports 
of successful field investigations (see suggestions at the end of 
this chapter). The nature of field work will vary so greatly 
with the type of investigation as to make any detailed treat- 
ment more s inapplicable to some specific situations. 
We shall not attempt, therefore, to do more than designate 
the principal problems likely to occur in all interviews. 
‘Mote detailed consideration of field work must be left to 
specialized treatises on surveys and interviewing. 

The technique of field work may be considered from three 
principal points of view: (1) the authority and backing of 
the investigator ; (2) the organization of a field staff; and 
(3) the interview. 


B. THE AUTHORITY AND BACKING OF THE FIELD WORKER 


Since field work of any kind involves encroachment upon 
the time, premises, and information of other people, or upon 
the community as a whole, the first consideration is to justify 
such encroachment. When the field work is of an official 
character and is undertaken by the proper authorities in the 
fulfillment of their duties, this constitutes in itself the re- 
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quired justification and authority. Such is the case, for ex- 
ample, in the taking of the Federal or State Censuses, the 
inquiries of the local assessor, tax collectors, etc. But most 
field work primarily for research purposes is not of this type. 
There is, to be sure, a great deal of research or investigation 
carried on by official bodies — commissions, survey commit- 
tees, etc. — which derives its authority from the government, 
and thus has the same justification and authority as the field 
work conducted by any other regularly constituted officials. 
But allowing for the research carried on by such bodies, there 
is still a great deal carried on by private individuals and pri- 
vate organizations which requires the cooperation of the com- 
munity for its successful execution. In the physical sciences, 
for the most part, we seize our material, whether it be the 
lower animals or inanimate objects, and study it without 
consulting the material itself. But in social research, the 
passive if not the active participation of the individuals or 
the community to be studied is frequently essential. Herein 
lies, of course, one of the principal obstacles to social re- 
search as compared with research in the physical sciences. 

The best method of securing the cooperation of the indi- 
viduals or the community to be studied is to secure for the 
investigation the approval of leaders in the community. 
This might include such approval by the,duly constituted 
authorities and thus make cooperation mandatory. More 
frequently, however, we must rely on the less overt social 
pressures—the recommendation of the priest, the chute » 
the civic organizations—for a friendly attitude toward th 
research on the part of the persons touched by the inquiry. 
The first step, then, is to determine which persons’ coopera- 
tion will be required in the proposed field work and, sec- 
ondly, to secure for the project the approval of the organiza- 
tions, officials, or leaders to whose judgment the persons 
concerned are accustomed to defer. The final cooperation . 
of each individual concerned must, of course, be achieved by 
the technique of the interview, the letter, or whatever form 
of approach is employed. But the approval of the leaders 
of the community will materially facilitate the personal ap- 
proach. 

The approach and backing of important social and civic 
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agencies in the community is important not only from the 
standpoint of securing community cooperation, but the finan- 
cial support of such agencies is frequently required in any 
field work which is at all comprehensive. While the number 
and types of organizations whose backing is desirable vary 
greatly with the scope and purpose of the projected field 
work, it is usually desirable to secure the backing of more 
than one agency, as not all people in a community may be 
closely associated with or pay much heed to any single agency. 
For field work involving the whole community, the backing 
of a committee representing all the important interests and 
activities in the community is most desirable. It insures an 
avenue of approach and a basis of appeal to everyone. In an 
age of high-power salesmanship, people are often properly 
wary of giving even apparently harmless information to a 
stranger, regardless of the explanation the inquirer may offer. 
The approval of an organization in which the informant has 
confidence becomes, therefore, an introduction for the field 
worker and an assurance to the informant that he may co- 
operate freely without the danger that the information which 
he gives will be used to his disadvantage. The publicity 
which can be secured for the investigation (when this is de- 
sirable), by having it sponsored by prominent representa- 
tives of important civic and social agencies, also prepares the 
public for the field work and greatly facilitates the task of 
the field worker. Whether financial assistance is sought or 
not, therefore, backing for any investigation involving field 
work should be secured from the organizations which com- 
mand the confidence of the community. We are speaking 
here of “community” studies in which the cooperation of 
large numbers of officials and other members of the unit 
studied is essential. Private commercial studies, polls, etc. 


are, of course, successfully carried out without any official or 
group sanction. 


C. THE ORGANIZATION OF A FIELD STAFF 
Before the regular field work can be organized it is neces- 
sary for the director of the investigation, in addition to his 


general plans, to familiarize himself with the fields and the 
sources of information. Only in this way can he determine 
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the size and type of staff necessary as well as what persons, 
groups, or organizations are to be canvassed by his field 
workers. Who may be qualified field workers and the de- 
gree of organization and supervision required depends en- 
tirely upon the type of field work to be done. In many local 
surveys of general community conditions, volunteer field 
workers who are especially interested in certain aspects of 
community life, such as teachers, social workers, and stu- 
dents in special fields, may prove earnest and capable field 
workers. On the other hand, an inquiry of national scope, 
like the Federal Census, requires a large number of workers 
not previously trained, and in such cases the most careful 
organization and supervision becomes necessary, if the re- 
sults are to be complete, comparable, and reliable. While 
the requirements of a staff will vary greatly according to the 
scope and nature of the inquiry, we may consider certain 
general principles which may be said to apply in greater or 
lesser degree to the selection, instruction, organization, and 
supervision of any field staff. 

If the inquiry is of a specialized nature so as to require 
the services of specially trained field workers, these would be 
selected as specialized talent is selected for any other work. If 
the field work can be performed by persons without special 
training, provided they are carefully instructed, and if the ne- 
cessities of the investigation require the utilization of such per- 
sons, the most important criterion of their desirability is that 
they should be able and willing to follow instructions ex- 
plicitly. This does not mean that persons who have this me- 
chanical ability will necessarily be desirable field workers. 
Most field work requires resourcefulness, initiative, and tact, 
as we shall see in our discussion of methods of interviewing. 
At the same time, persons who as field workers feel free to 
follow their own inclinations on what is to be achieved, are 
very difficult to manage, and their work is frequently worth- _ 
less from the standpoint of the investigation as a whole. The 
essence of field work data, as of all other scientific data, is 
that they should be accurate and comparable. This end can 
be achieved only by a certain uniformity of procedure. When 
untrained workers must be employed, instructions as specific 
as possible, designed to cover every contingency that may 
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arise, should be supplied to each worker. Carefully prepared 
schedules with detailed instructions for filling them out are 
the best devices for securing uniformity and accuracy in the 
collection of field data. 

Whenever possible, the employment of immature, un- 
trained, and tactless field workers should be avoided in so- 
cial investigation. In addition to the unreliability of the 
data collected by such persons, it is important not to create 
an unfavorable attitude on the part of the public toward the 
particular project and toward field investigation in general. 
As has been pointed out above, the success of much social 
research depends largely on the cooperative attitude of the 
groups studied. This cooperative attitude may be sacrificed 
by incompetent field workers, not only in a particular project, 
but for future projects as well. The practice of sending high- 
school and immature college students indiscriminately into 
the field to gather original data for term papers is to be de- 
plored. It often becomes the basis of deep prejudice against 
all field workers in a community. Untrained field workers 
should not be employed without thorough instructions and 
close supervision. 

The organization and supervision of a field staff, again, 
depends upon the scope of the investigation. The following 
account of the Federal Census of 1940 will give some idea of 
the organization required for this, the largest of all field 
inquiries : 2 z 


More than ever before the 1940 Census has been a cooperative 
enterprise on a Nation-wide scale. In a very real sense the decen- 
nial census of the United States is a national undertaking, the 
Bureau of the Census being but the instrumentality for the 
census. Conferences, voluminous correspondence, and in some 
cases public trial, determined the questions to be asked and the 
tabulations to be made. Through the cooperation of newspapers, 
radio stations, motion pictures, trade associations, local citizens 
committees, and other groups too numerous to mention, the 
Nation was informed about its census. And the Nation re- 
sponded with a good will to give reports for more than 131,000, 
000 individuals, 36,000,000 Swelling units, 6,000,000 farms, 3 
000,000 places of business, 180,000 factories, and 30,000 mines 
and oil wells. 


? Annual Report of the Secretary of Commerce, 1940, pp. 37, 40-42, 59 
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Every census has four distinct phases ; (1) planning and prep- 
aration, (2) enumeration, (3) compilation of the data, and (4) 
preparation of final reports. . . 

oe adequate for the gooo employees required for the decen- 
nial census and other Bureau work at the peak during 1940 and 
1941 was a pressing problem. In March 1940, the three largest 
divisions of the Bureau concerned with the decennial census, 
population, agriculture, and mechanical tabulation, were moved 
into the new Census Building, just completed. Although this 
modern building provides 396,000 square feet of space, the 
Bureau has had to rent additional space outside the Commerce 
Building for certain of its functions, and will utilize the new 
building for a double shift during the peak of the decennial 
census work... The following number of employees is approx- 
imately correct for the peak of field operations on the decennial 
census: total employees in field administrative offices (includ- 
ing supervisors, clerks, squad leaders, and interpreters), 7361; 
enumerators for population, agriculture, and housing, 101,916; 
enumerators for business and manufactures, 6396 ; and enumera- 
tors for irrigation and drainage, 261... 

Employee recruiting and training. Nearly 130,000 persons 
will work in the field or in the Bureau’s Washington office on 
some phase of the Sixteenth Decennial Census. mphang has 
been given both to the recruitment and the training of this army 
of temporary employees. y 

During September, October, and November, 1939, 180 candi- 
dates for the position of area manager were given an 8 weeks 
training course in Washington. These classes were taught by 
census experts in their specific fields, and included instruction in 
administrative and personnel procedures, as well as in schedule 
forms and instructions. Objective tests on each census subject 
were given and the grades on these tests were correlated with the 
grades made on general intelligence tests, and together these 
grades formed part of the basis for the hiring and placement of 
these workers, 

During December each area manager held a 1 week training 
program for his district supervisors using detailed training in- 
structions prepared in the Washington office. The district super- 
visors in turn held from 1- to 4-day training classes for the census 
enumerators just prior to the beginning of their work. Again, 
objective tests were prepared for the selection and placement of 
enumerators. 

Periodic correspondence instruction was given to the area and 
district supervisors from January through March. Regional 
training classes were held in nine cities for special instructions on 
population, agriculture, and housing during ebruary and March. 

Sound motion pictures were made in Washington and dis- 
tributed to district supervisors to assist them in training enumera- 
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tors. This visual method of training was an innovation of the 
Sixteenth Decennial Census and was perhaps the largest visual 
training program ever given in so short a period, Instruction 
manuals, illustrative examples of completed schedules, and ob- 
jective tests were prepared to assist the district supervisor in the 
training of enumerators. 

An intensive program is being conducted for temporary office 
employees to train them in editing, coding, and machine tabula- 
tion. 

Census publicity. A Division of Public Relations was organized 
in August 1939 under specific authority of Congress to plan and 
execute an educational campaign for enlisting Nation-wide co- 
operation in the decennial census. 

Operating through various media of public expression — news- 
papers and periodicals, radio, motion pictures, local committee 
organizations, public officials, and the Bureau's field force —an 
intensive campaign was conducted in advance of and during the 
enumeration. More than 2000 cooperating local committees 
sponsored by chambers of commerce and State, county, and 
municipal officials assisted this program in their localities. 

School officials, church leaders, civic groups, and fraternal or- 
ganizations throughout the country also gave their assistance to- 
ward insuring the completeness of the census. Valuable con- 
tributions in services were received from county farm agents, 
home demonstration agents, and other leaders in the rural sec- 
tions of the country. Other agencies of the Federal Government 
also cooperated in pening the word along through their local 
offices everywhere. The 40,000 postmasters throughout the Na- 
tion gave space on lobby bulletin boards to posters and other 
expressions of census information. . . 

ditors of more than 10,000 daily and weekly newspapers, and 
of more than 1600 general magazines, business and professional 
periodicals, agricultural publications, and organs of trade associa 
tions and commercial organizations also cooperated in giving 
space to explanatory articles about the census as well as editorials 
urging the fullest cooperation on the part of their readers. 

he Office of Education contributed much to the success of 
the radio program as did, of course, the various radio broad- 
casting chains, commercial program sponsors, and hundreds of 
individual radio stations. Fimilarly, the cooperation given by 
several thousand individual motion picture theater operators and 
by national news-reel companies is to be credited as a contribu- 
tion of major importance. 

_ Other agencies and groups, too numerous to detail in considera- 
tion of space limitations, contributed also—as, for instance, news 
Services and press associations, Washington newspaper representa 
tives, free-lance writers, volunteer speakers, library officials, hea 
of individual business organizations, and so on. The Bureau 4 
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had the benefit of consultative service by technical experts on the 
various media of publicity. . . 

Special provision was made for the enumeration of transients 
throughout the country. As the census enumeration is carried on 
over a period of several weeks, transients may be missed by the 
enumerator if they move during the enumeration period. In 
order to avoid this contingency, April 8 was set aside as the day 
when the usual places of residence of transients in all cities would 
be visited by enumerators. Special blanks were used to improve 
the enumeration of W.P.A., C.C.C., and N.Y.A. workers. Absent 
family schedules and nonresident schedules were used more ex- 
tensively than in previous censuses, and were mailed directly to 
Washington for allocation to the proper enumeration districts, 
A card for new occupants was left in all vacant dwelling units to 
insure the enumeration of persons moving during the census 
period. The use of these supplemental forms was aimed directly 
at securing a more complete count of transient population than 
in previous censuses. It has been necessary to check these various 
forms against the names of the population schedules to avoid 
duplicate enumeration. It is evident that a good enumeration 
of transients has been secured. 

Trial census. A special census of St. Joseph and Marshall 
Counties, Ind., was authorized by the Secretary of Commerce to 
be taken as of August 14, 1939. This census, a preview of the 
decennial census of population, proved to be very helpful in 
providing a testing ground for the schedules, auxiliary forms, 
Instructions, and procedures planned for the decennial census 
and indicated some necessary changes in the schedules and 
procedures, Two innovations of the trial census were the use of 
objective tests as a means of testing and selecting the enumerators 
after a period of census training and the employment of squad 
leaders as supervisors of from 10 to 20 enumerators. Both of 
these innovations were adopted for the Sixteenth Decennial 
Census. 

This trial census also provided real data for training office 
employees before the population schedules from the regular cen- 
sus were received. Preliminary editing and coding instructions, 
card forms, tabulations, and even table forms for the final census 
reports for 1940 were developed well in advance of any previous 
census on the basis of this substantial “census preview.’ 


The precautions taken to secure a uniform type of field 
work in a project of much lesser scope is illustrated by the 
supervision of field work connected with the scoring of the 
health departments of eighty-six cities by the American Child 
Health Association discussed in Chapter IX. 
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Great pains were taken to insure a uniform interpretation of 
the schedule by the surveyors. Each surveyor, after all pants 
explanations before beginning, was =e imp on his first sur- 
vey by the senior surveyor who was chiefly responsible for the 
preparation of the schedule. The meaning of the questions and 
the type of answers expected were thoroughly discussed. This 
was done to avoid misinterpretation and consequent lack of 
comparability of the answers. In addition to this precaution each 
report was subjected to critical inspection, and omission and non- 
specific entries were taken up directly with the surveyors by cor- 
respondence. The value of this uniform supervision of each 
surveyor was unmistakable.’ 


Methods of handling cases of refusal and failure to con- 
tact informants are suggested in the following account : 


Both preventive and corrective measures were adopted to mect 
the problems of refusals and of failures to contact families. In- 
complete schedules and refusals are largely due to the approach 
used by the field agent, so the best preventive measures are di- 
rected at a careful training of agents and control of assignments. 
Calls at night to interview someone other than the person refus- 
ing, eliminating those agents who turn in too many refusals, shift- 
ing unsuccessful agents from the more difficult districts to the 
sections where families are more cooperative, reduced refusals. 
The agents in the present study were instructed to make at least 
two calls upon families before turning in the cards with partial 
or no information. Assignment supervisors were instructed to 
assign cases for a third visit to different and more skilled agents. 
Another control of the work of the agent was exercised throug 
check interviewing which made it possible to discover agents who 
reported vacancies when families refused information. It was 
claimed by some supervisors that emphasis on speeding up Pr 
duction would increase the number of incomplete and unac 
ceptable schedules. This was not found to be the case, however. 
In fact, the most productive agents also had the fewest refusals, 
and the most aceurate schedules. A third important preventive 
measure consisted in securing favorable presentation of the 1m- 
vestigation in the press. 

Corrective methods in assuring the collection of the random 
sample as selected, consisted in measuring the biases introduc 
by refusals, incomplete schedules, and cases where families could 
not be contacted. The first analysis of any possible biases was 
made by finding the proportion of refusals an uncontacted cases 
in each district. Early reports indicated that a significant number 


2 American Child Health Association, A Health Survey of 86 Cities, (New 
York, 1925), p. 9- 
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of families in the wealthier neighborhoods were not at home or 
refused to give the information requested. During the hottest 
weeks of the summer months as many as 50 percent of the famil- 
ies in the better districts of the large cities had left the city and 
moved to vacation places. It was possible to revisit these families 
upon their return to the city in the fall after the early analysis 
had revealed the need for revisits.* 


These examples will illustrate the types of organization 
and supervision of field work in cases where the data sought 
are specific and objective. Methods of organization and su- 
pervision must, of course, be modified to suit the particular 
investigation. Field work of a preliminary, exploratory, or 
“detective” nature cannot be as rigidly detailed in advance 
as in the above cases. But such exploratory field work must 
be regarded as preliminary to scientific surveying. It may 
produce clues and provide the basis for the systematic study 
of the situation. In such work the greatest possible freedom 
must be allowed the worker. But for the collection of data 
to form the basis of scientific generalization, the require- 
ments of objectivity, uniformity, and comparability compel 
us to rely on more formal organization and supervision. 
Eternal vigilance in the form of constant checking is the 
price of accurate data in all scientific observation. 


D. THE INTERVIEW 


A complete account of the principles and practices of in- 
terviewing would involve a lengthy treatise on the technique 
of salesmanship, newspaper reporting, legal examination, in- 
cluding “third degree” methods, case work, and psycho- 
analysis, as well as an exhaustive consideration of the prin- 
ciples of psychology. Any one of these subjects is in itself 
a suitable topic for a volume. For our present purposes, 
however, we need consider only those aspects of the subject 
which relate to the more external technique of any field 
work involving interviewing. For a detailed consideration 
of the social and psychological processes involved, and the 

4E. H. Schoenberg and M, Parten, “Methods and Problems of Sampling 


Presented by the Urban Study of Consumer Purchases,” Jour. Amer, Statis. 
ASSO., 32:317-18, 1937- 
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specific technique to be employed, the reader must be re. 
ferred to technical treatises on the subject. 

The interview may be of significance to social research in 
two principal ways. (1) It may be simply a method of secur- 
ing data from a secondary source when our interest is mainly 
in the data that are conveyed to us by the informant instead 
of in his verbal behavior in conveying them. Thus the census 
taker or assessor inquires about certain objective facts as a 
short-cut method of securing them. As a rule he ignores the 
attendant behavior of the informant. When the information 
transmitted is our primary interest, the same principles of 
criticism that apply to any secondary source are applicable to 
the interview. (2) The interview may also be of importance 
to social research as a laboratory study of verbal behavior. 
In this case our primary interest is in the verbal responses, 
as such, to stimuli of various kinds. From such responses, 
the psychoanalyst, for example, makes inferences and draws 
conclusions. 

In most inter¥Views we are interested in both aspects of the 
process. Both types of interview involve to some degree es- 
sentially the same techniques of approach—securing ap- 
pointment, establishing rapport, inspiring confidence, and 
releasing mental mechanisms. The researcher is interested 
in the objective data secured from the interview, such as in- 
come, the number of children, their ages, etc, and also in 
the personality of the informant — his attitudes : prejudices, 
likes, and dislikes—as revealed by his verbal behavior, in- 
cluding the subtle gestures that accompany it, such as facial 
expression, tone of voice, and so forth. As in any scientific 
observation, research by interview consists of selecting out 
and recording those aspects of the total situation in which 
we are interested. 

1. Selecting persons to be interviewed.—The first prob- 
lem which presents itself in research involving interviewing 
is the selection of persons to be interviewed. How definitely 

* Some of these techniques are treated in Chaps. VII and VITI. Perha the 
best treatment of interviewing from the seandpolat of the social worker 
Pauline Young, Interviewing in Social Work, (McGraw-Hill, 1935). Seea 


E. S. Bogardus, The New Social Research Miller, Los Angele 
1926), Chaps. IV, V. ee as a 
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such persons can be selected in advance depends upon how 
well-defined is the object of our inquiry and how completely 
the sources of such information are known. If we are inter- 
ested in the wage scale of a certain industry, the responsible 
heads of the various establishments in this industry at once 
suggest themselves as the persons from whom to secure this 
information. The social worker seeking items of general in- 
formation about a case consults neighbors, employers, rela- 
tives, teachers, and public officials. She may get the infor- 
mation from any, all, or none of these.* The problem of 
whom to interview, therefore, is a question of deciding who 
will yield most easily the desired information with the great- 
est reliability. Our familiarity with the field in which we 
are working will determine our ability to select such sources 
without employing the trial-and-error method. 

In any case, all of the principles of valid sampling which 
we discussed in an earlier chapter apply in the selection of 
persons to be interviewed. Unplanned selection of inform- 
ants, which is not to be confused with random selection in 
the statistical sense, usually does not yield a representative 
sample. Much interviewing by journalists is of this type. 
The readily available and the garrulous person gets an un- 
due degree of attention in such interviewing. Indeed, the 
validity of results secured through interviews, as was also 
noted in connection with our discussion of mailed question- 
naires, depends on the representativeness of the persons who 
cooperate. We saw in Chapter VII that the people who do 
not answer questionnaires frequently differ in important ways 
from those who do.” The interview, as contrasted with the 
mailed questionnaire, is sometimes assumed to avoid this 
difficulty. But obviously the interviewer will also fail to 
reach or secure the cooperation of some persons selected to 
constitute the sample. Do the people thus missed differ in 
respects important for the particular investigation from those 


6 For a discussion of Principles Governing the Choice of Sources in relation 
to social case work, see Mary E. Richmond, Social Diagnosis, (Russell Sage 
Foundation, 1917), pp. 169-76. 1 

7A method of analysis for estimating the maximum effect of missing cases 
not found, unknown cases, and refusals is proposed by S. A. Stouffer and 
P. F. Lazarsfeld in The Family and the Depression, (Social Science Research 


Council, Bull. 39, New York, 1937), p- 176. 
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who are suecessfully interviewed? A recent study of this 
subject revealed the results shown in Table VII: ° 


DELTE” m 


TABLE VII 


i 
Some Personal Characteristics on which Refusals, Cooperators, and y 
Unavailable Groups for an Interview Differed ’ 


Personal Characteristics Cooperated | Unavailable 
or 
70 
Female 


Low economic status 
Low educational level 


Industrial workers 
Housewives 
20 to 24 years of age 
25 to 44 years of age 


This study indicates that “refusals tended to come most 
frequently from housewives and people of low educational - f, 
status. The unavailable group was predominantly male, — 
included more industrial workers than other groups, an 
tended toward low economic levels. Refusals were charac: _ 
terized by individuals of intermediate ages, while more young 
individuals were unavailable for interviewing.” ° | 

2. Securing appointment for interview.—When the per- 
sons to be interviewed have been selected, the next step 8 
to secure appointments with these persons. Securing @ denii 
nite appointment is desirable from several points of view; | 
it saves the time of the field worker by insuring him that he 4 
will be received, and by enabling him to use his time most 
economically; it avoids the antagonistic attitude which 1$ 
frequently, and justly, aroused by the attempt of anyone tO 
“break in” when the prospective informant is engaged in 
other work. Ordinarily, a request by letter for an intervieW 

$ Hazel Investi 
gator ?” Jour. Applied. Re cores’ Pag oy Ay tased on attempt 
to re-interview a tative sample of 1800 (out of an original sample © 
De oaiiy 2800) by the Office of Radio Research, Columbia Uni ths r 

panel was interviewed approximately once a month for the six mon 
before the election of 1940, See Elmo > “Checks to Increase Polling 


Accuracy,” Public Opin. Quart, 5:87-90, 1941. f 
Gaudet and Wilson, > yeh sy 


_ 
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is preferable to a request by telephone, as om gaa per- 
mits a more adequate statement of the purpose for which the 
interview is desired, the auspices under which the inquiry is 
undertaken, and other matters designed to insure the grant- 
ing of the interview and a favorable reception. There are, 
of course, circumstances under which so complete a state- 
ment of the purpose of the interview would insure its not 
being granted, or cause a “preparation” on the part of the 
person to be interviewed which might materially destroy the 
value of the information secured. The technique of secur- 
ing an appointment must depend on the nature of the infor- 
mation sought, the personal idiosyncrasies of the person to 
be interviewed, and the degree of rapport between him and 
the interviewer, or between the interviewer and the auspices 
under which the interview is sought. Sometimes interviews 
may be secured through the influence of friends or associates 
of the person with whom the interview is sought. Some form 
of preliminary arrangement for the interview is desirable 
when convenient, but much successful marketing and other 
research, such as public opinion polls, is conducted without 
special arrangement for interviews. 

3. Favorable conditions for interviewing.—For most inter- 
views privacy is highly desirable, though the importance of 
this varies with the nature of the interview.1° When personal 
data are sought, the presence of other people is almost always 
inhibitory. For such interviews, the office of the interviewer 
acts as an inhibitor through the suggestion of officialdom. 
Where personal revelations are desired, places most removed 
from the social milieu of the informant yield the best re- 
sults. There are other types of interviews which can best 
be carried out in the office of the person interviewed, where 
records and corroborative material are available. The place 
and environment in which the interview takes place must 
be governed by the nature of the interview and the practical 
considerations involved. 


10 Cf. E. S. Bogardus, “The Social Research Interview,” Jour. A lied Sociol., 
10:73, 1925. For certain purposes the group interview may be the most sat- 
isfactory and economical of the field worker's time. See Bogardus, “The 
Soy Interview,” Jour. Applied Sociol., 10:372-82, e s 

11 Cf. Read Bain, “The Impersonal Confession and Social Research,” Four. 
Applied Sociol., 9:359, 1925- 
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4. Technique of the interview. — When the interview has 
been secured, its usefulness will depend largely on the per- 
sonality and skill of the interviewer. All the subtle influ- 
ences of personal suggestion now become operative. If the 
purpose of the interview is merely to get certain objective 
facts which the informant has little or no reason to hide, no 
subtlety is required. The technique to be employed in a 
given case will also be determined by the general education, 
specific training, and personality traits of the informant. It 
is well for the interviewer to inform himself as far as pos- 
sible on these matters before the interview and to prepare 
himself accordingly. But no amount of preliminary prepa 
ration will serve as a substitute for tact and spontaneous in- 
sight in a concrete situation, regardless of the type of inter- 
view. In the more difficult “confessional,” “life history,” 
or other self-revealing interviews these qualities are highly 
essential. 

Thorough familiarity on the part of the interviewer with 
the purpose of the interview may be set down as a general 
requirement for a successful interview of any kind. Only 
through such familiarity can we know whether the inform- 
ants actually get the meaning of the questions, and how we 
are to interpret their answers. Everything said in a previous 
chapter regarding the formulation of questionnaires also ap- 
plies to interviews, although the latter afford additional op- 
portunities of reformulating questions until they convey the 
desired meaning to the informant. This should not be taken 
to mean that less careful preliminary preparation is neces 
sary in interviewing. The best phrasing of questions and the 
anticipation of possible misunderstandings should be care- 
fully considered before going into the field. 

The type of interview and the kind of information sought 
will determine whether a formal schedule can be used in the 
interview. Whenever possible, it is desirable for the intet- 
viewer to memorize both questions and answers so that writ 
ing can be dispensed with during the interview. The writing 
down of answers in the informant’s presence may operate as 
an inhibitory influence in certain types of interviews- On 
the other hand, reliance on memory is subject to the poss! 
bility of inaccurate recollection of both questions and a 
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swers. This is a problem which must be decided in each 
investigation on the basis of the type of information sought 
in relation to the quality and experience of the interviewers 
available. One of the subtlest sources of error which the 
interviewer must guard against is his own bias. Rice reports, 
for example, that an investigator who was personally a pro- 
hibitionist attributed 62 percent of the applications to a 
municipal lodging house to drink and only 7 percent to 
“industrial conditions.” A  socialistically inclined investi- 
gator found for a presumably similar random sample of ap- 
plicants, 39 percent attributable to drink and 22 percent to 
industrial conditions. It is true that while the investigators 
are reported as experienced members of the staffs of social 
agencies, the questions of the schedule used were admittedly 
defective in allowing or possibly inviting the bias found. 
Other investigators under much more carefully controlled 
conditions have found, however, similar subtle influences of 
the interviewer's bias.12 We have mentioned in a previous 
connection the dangers of leading questions, the tendency 
for people to give the kind of answer the interviewer wants, 
and the necessity for extreme care in the formulation of 
questions. These conditions must be even more carefully 
observed in the interview, because all the subtle influences 
of personal contact — appearance, facial expression, ‘tone of 
voice, etc.—are here operative in addition to the more obvi- 
ous influences. 

The first step in the interview is to gain the confidence of 
the informant and to release whatever mental inhibitions may 
exist with reference to the interview. There are a great 
variety of devices in common use for this purpose. Ordi- 
narily the interviewer will first introduce himself and refer 
to the local or other backing of the investigation. The first 
reaction of the interviewee is likely to be, “Here is another 
salesman.” Accordingly, it is well to emphasize (a) that in- 
formation only is sought; (b) that the informant’s help is 
sought as an act of generosity on his part, rather than as 
something from which he or humanity will greatly profit — 

12See Paul F. Lazarsfeld, “Repeated Interviews as a Tool for Studyin 


Changes in Opinion and Their Causes,” Bulletin of the American Statistica 
Association, 2:3-7, 1941. 
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the advancement of science, etc.; (c) that the information 
will be treated as confidential (when such is the case), and 
will never be used to the informant’s disadvantage or embar- 
rassment. It is well also to start with easy, conventional 
questions which no one objects to, or with questions of spe- 
cial interest or challenge to the informant.!® To get him to 
talk at all, on any subject, is the first objective. Once he has 
started answering questions, he will answer many inquiries 
which would have been hopeless before he was “warmed up.” 
Evidences of personal interest in the informant, aside from 
the purpose of the interview, may be helpful. The inter- 


viewer who has membership-character in the group or the . 


class of people he is interviewing has obvious advantages in 
putting the informant at ease. As Katz has said: “A poorly 
dressed manual laborer with ungrammatical speech will have 
difficulty in interviewing along Park Avenue. Similarly, the 
Harvard graduate with a Harvard accent may not win the 
confidence of the miners in the Pennsylvania coal fields.” “ 
To evidence interest in the informant’s occupation ; to show 
one’s own familiarity with his work; to refer to important 
friends of the informant as if one were quite well acquainted 
with them ; to tell of one’s own experiences or problems and 
ask the informant’s advice or reaction to them—these are 
illustrations of the scores of devices employed to get an in- 
formant “‘started.” 

Indirect approaches may sometimes yield more valuable 
results than direct requests. Likewise, impersonal and dis- 
interested demeanor will do more to secure frankness than 
will direct urging to be frank. When there is reluctance to 
communicate important facts, the interviewer may stimulate 
communication by appearing to be misled in a direction dis- 
advantageous to the informant. This will sometimes bring 
forth the desired information in the form of a correction. 
The extent to which these devices are useful is; again, deter- 
mined by the type of informant. ‘There are objectively 
minded people who are glad to give information for the sake 
of truth and the advancement of research, and who would 

18 See Chap. VII, 


. 189-90. g ; 
4D. Katz, “The Pubic Opinion Polls and the 1940 Election,” Public Opin: 
Quart., 5:52-78, 1941, p. 69. 
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resent any but the frankest and most direct approach. The 
technique of the interview must be modified to suit each 
occasion.?® 

Bogardus summarized the methods of interviewing and 
the types of informants as follows: 


Experience indicates that a witness does best when (1) allowed 
to use a narrative form, (2) when least interrupted, (3) when en- 
couraged to follow a time order, and (4) when answers are not 
suggested by questions, but when questions are used simply to 
release narratives. Persons display a variety of ways in “witness- 
ing,” and hence the examiner must change his methods accord- 
ingly. (1) The rambling witness requires a close and catechetical 
examination. (2) The dull and stupid witness calls for inex- 
haustible patience on the part of the advocate. (3) The timid and 
self-conscious witness needs to have his attention taken off the ef- 
fects regarding himself or others that he imagines are going to be 

roduced. (4) The bold and zealous witness is easily led astray 

y his own exaggerated egoism and is usually a dangerous witness 
from the standpoint of truth. (5) The hostile witness is of value 
to truth only in indirect ways, by studying his negative reactions 
— otherwise, he is to be avoided. (6) The lying witness may be 
(a) an innocent liar, (b) a careless liar, or (c) a wilful liar. The 
classification suggests in itself the attitude to take toward him. 
(7) The flippant witness is to be circumvented, to be treated as a 
“wild animal ready to tear you if she should get near enough. 
Her “frenzied exuberance” is to be encouraged, for thereby she 
will “give herself away.” (8) The dogged witness is to be ap- 
proached by “getting little answers for little questions,” and to get 
him loosened up, and accustomed to talking. (9) The hesitatin; 
witness is to be allowed to take his time. His weighing an 
balancing of answers needs to be studied. Sometimes he may 
hesitate simply in order to be “scrupulously accurate.” (10) The 
nervous witness is to be dealt with gently and to be encouraged. 
(11) The humorous witness may have his good humor appealed 
to, and may be as he is, a jolly good fellow. (12) The cunning 
witness is not cunning so much as he thinks he is. He is to be 
met with cunning and needs to cea a era goen 
up.” (13) The canting hypocrite is likely to believe in “religio 
and ni Bis Soodness,” Td to approach a down-right lie by 
shirking it and by.using phrases that disclose his weaknesses. (14) 
The positive witness can be led into contradiction of what she or 
he has already said.1* 

om 5 , “The Social Research Interview,” Jour. Applied 
Sociol, enka ee: ane ako E. T. Krueger, “The Technique of Secur- 
ing Life History Documents,” Jour. Applied Sociol., 9290-98, 1925. 

ls E, S. Bogardus, “Methods of Interviewing,” Jour. Applied Sociol, 9: 
459-61, 1925. 
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As stated at the outset of this section, it is not possible 
here to enter into a detailed consideration of the intricate 
interstimulation and response which are the structure and 
content of the interview. The fact is that there are very few 
scientific data available on the subject, although research in 
this field lies at the very foundation of sociology." Like all 
other techniques, skilled interviewing and social work gen- 
erally must wait upon the development of the basic sciences 
which underlie these techniques. Only from such founda- 
tions can a scientific technique be developed. 

Skillful interviewing will remain an art. As in the case of 
other arts it will become a more refined and communicabie 
technique in proportion as the scientific principles under- 
lying it are understood and applied. These principles must 
in turn evolve largely from the experience of successful in- 
terviewers. Pauline Young has summarized that experience, 
especially as it relates to social case work, in the following 
practical maxims : 18 


I. Preparation for the Interview : 


1. Understand why the interview is held. 

2. Be prepared to be viewed by the interviewee and to 
answer any question he may wish to ask. 

3- Gather all ible clues and concentrate on each one 
to see which might be best utilized toward a successful 

MS aber to the interviewee. 

4. If time is limited and problems are pressing, formulate 
tentative objectives for your interview or chart its course 

as far as possible. 

Keep in mind the unique problems of each case and 

relate them to the total social situation in which the 

interviewee finds himself, as far as is possible under the 

circumstances, 

6. “Size up” your interviewee from whatever clues you May 


possess, but keep your mind flexible when you are mM 
contact with the interviewee. 


3 The basic behavior involved has been approached in a detailed quan 
titative way by E. D. Chapple, “Measurin Haman Relations: An Introduc: 
tion to the Study of the Interaction Individuals,” Genetic Psychology 
Monographs, 22:3-147, 1940. For a general bibliography of the subject, 
see Pauline Young, Interviewing in Social Work, (McGraw-Hill, 1936); For 
a keen analysis of some psy; i 1s of interviewing, especially 2$ 
related to market research, see The Technique of Marketing Research, 
(McGraw-Hill, 1937), Chaps, 3 and 4, by Paul F. Lazarsfeld. 
1$ Pauline Young, op. cit, pp. 86-88. 


o 
< 


II. 


11. 
12. 
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. Make appointments whenever possible, thus showing 


respect for the client’s time. 


. Provide privacy, comfort, and agreeable surroundings. 
` Be modestly and suitably dressed for each occasion. 
. Introduce yourself to the interviewee, explain the func- 


tion of the agency you represent, and state briefly the 
purpose of visit. ; 
Extend cordial greetings. 

Observe conventionalities of the interviewee’s home and 
social group. 


The Interview Proper: 


13. 
14. 
15. 


16. 


14. 


18. 


19. 
20. 
21. 


22. 


23. 
24. 


25. 


26. 


27. 
28. 


29. 
30. 
gi. 
32. 


Show interest in everything the interviewee wishes to 


tell you. Make the interview “his moment.” 

Be at ease with the interviewee, thus helping him to be 
at ease with you. 

Adopt a leisurely pace to give the interviewee the feel- 
ing that you consider his situation seriously. 

‘Allow sufficient time to learn the facts which will enable 
you to understand the problems involved. 

View all problems confronting the interviewee from his 
point of view. 

Make suitable allowances for your preconceived notions 
about his problems, for your biases and prejudices. 
Meet the interviewee on his own terms. 

Gain, keep, and deserve the interviewee’s confidence. 
Identify yourself with the interviewee through similar 
experiences, piper of view, mutual likes and dislikes. 
Gain sympathetic insight into problems confronting the 
interviewee. 

Assign social status to the interviewee, remembering 
that status is one of his most priceless ossessions. 

Ask only questions which you would not resent an- 
swering about yourself under similar circumstances. 
Ask questions which are easily understood. 

Ask questions which do not imply their answers. 

Take care not to sound insinuating or impertinent. 

In asking questions be frank and straightforward rather 
than cunning. K , 
Avoid “getting around” the interviewee or extracting 
information against his will or knowledge. 

‘Allow the interviewee to tell his story in his own way 
and to take his time in telling it. 

Allow the interviewee to think for himself and give 
him time to think his situations through. 

“Make your minds meet”; that is, be sure that you 
understand what the interviewee says, what he wants, 
and what his culture complexes are, and he yours. 
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. Listen with interest. s ; f 
. Follow up every important clue of interest to the inter- 


viewee. 


. Seek to understand source, cultural setting, and devel- 


opment of the interviewee’s behavior patterns. 
eet objections in a way which is satisfying to the 
interviewee. 


. Avoid ordering-and-forbidding techniques. 
. Allow for face-saving. TE 
. Lessen tensions by assigning status, by soliciting help, 


by complimenting good performance, etc. 


Ill. Controlling the Interview: 


40. 


41. 
42. 


43- 


Check the story in your own mind to see if there are any 
inconsistencies. 

Deal with inconsistencies as misunderstandings. 
Remember the saying that truth is told to friends and 
lies to enemies. y 
Eliminate opportunities which an interviewee might 
seize upon to deceive you. 


44. If you sense that an interviewee is going to falsify, start 


out by stating the facts of the case as far as they are 
known to you. 


IV. Closing the Interview: 


45- 
46. 
47- 
48. 
49. 


50. 


Close the interview before you have worn out your 
welcome. 


Close the interview when the interviewee is at ease emo- 
tionally. 

Close the interview when the interviewee has caught the 
desire to manage his own affairs, 

If subsequent interviews are necessary, close when you 
still have something important to talk about. 

At the close of the interview ask if there is anything else 
your interviewee wishes to talk about. “What else 
poule yon care to tell me? What didn’t we touch 
upon 


est the success of your interviews. 


We have discussed above some principles of interviewing 
which may be said to have general validity, but which are 
primarily adapted to research rather than therapeutic Or 
other special purposes as regards the informant. Specialized 
forms of the interview such as concern legal, medical, psy- 
choanalytic, and general social work purposes are beyond 
the scope of the present discussion. Two special types of 
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interviewing, namely, the group interview and the repeated 
interviewing of a “panel” should be mentioned in passing. 


E. THE GROUP INTERVIEW 


For some purposes an oral interview with a group of peo- 
ple may yield information of a corroborative or “consensus” 
type which is perhaps more reliable than individual inter- 
views with each member would have been. The tendency of 
individuals to dramatize and to speak positively of matters 
about which they know little is likely to be restrained in the 
presence of other people who know about the subject. The 
people interviewed in a group may call each others’ atten- 
tion to aspects of a situation which receives general assent 
when mentioned, but which probably would not have oc- 
curred to more than one or two people in individual inter- 
views. Obviously, the group interview is suited to securing 
only certain types of information. When a suitable subject 
is under investigation, however, and the right kind of group 
can be assembled, the method has obvious advantages in tak- 
ing less time than individual interviewing. 


F. THE “PANEL” 


The labor and expense involved in large scale interview- 
ing, and especially in the selection of a new sample for each 
inquiry, has resulted in the so-called panel technique. This 
method consists of carefully selecting a representative sample 
of population and securing their cooperation for repeated 
interviews, as a sort of permanent body of informants. The 
technique is especially useful when a great wealth of detailed 
material is sought with special reference to changes over a 
period of time. Thus, a nation-wide sociological study called. 
Mass Observation was begun in England in 1937.7° Only the 
briefest summary of the procedure is possible here : 

On Feb 12, 1937, thirt le who did not know each 
other, Shonas h O naria eien and who were of widely- 
ranged ages and income levels, wrote down everything that oc- 
curred to them on that day in as great detail as they could muster. 
At about the same time, six observers went to a Lancashire cotton 


19 See First Year's Work, by Mass Observation, edited by Charles Madge 
and Tom Harrison, with an essay on “A Nationwide Intelligence Service, 
by Bronislaw Malinowski (Univ. of London, Lindsay-Drummond, London, 
March, 1938). 
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town where “= devoted all of their time to participation in and 
description of the lives about them. Another group went to an- 
other town where the members of the former community were 
accustomed to spend their holidays. The first group of thirty, 
plus hundreds more who later volunteered to join their ranks, 
' recorded impressions, opinions, and actions ; in short, everything 
they noticed in themselves or those around them. The two 
groups that entered into the life of the community were in- 
structed to observe in much the same fashion but were also given 
instructions from time to time as well as elaborate questionnaires, 
dealing with specific factors of smoking, drinking, gambling, and 
the like. There were, then, three kinds of reports : day surveys on 
the twelfth of each month made by observers all over the country, 
area surveys consisting of detailed reports on community life by 
full-time observers, and studies of isolated factors.1% 

Strictly ore the first group is the most analogous to the 
panel, since the day surveys consist of periodic reports by the 
same group of people concerning their own opinions and activi- 
ties. The full-time observers within given communities are more 
comparable to trained interviewers or to what the anthro ologist 
calls “informants.” If the community is so small that each mem: 
ber of it is interviewed about various aspects of his life, then the 
community itself forms another panel. If surveys of specific fac- 
tors such as smoking and reading habits, on the other hand, are 

ven to a random sample of the general population as well as to 

e day-survey people and the community observers, it is ho 
that these two types of material, the unselective detailed mass of 
day surveys, and the controlled interview-questionnaire may 
eventually be used to check as well as to supplement each other. 
Such a plan depends for its success on the sifting and classifying 
of the day-survey material by trained research A and clearly- 
determined points of reference. The workers are becoming more 
and more skilled, but there is increasing need for a structural 
index of the material which will be the basis for transforming 4 
mass of subjective material into objective data.2° 


é The panel technique has also been used by such organiza- 
tions as the National Board of Motion Picture Review, The 
Woman's Home Companion™ and by the Office of Radio 
Research.** The director of the latter organization has sum- 


1* There is no mention of how pondents were 
selected or interviewed ; they may Siar ie preampa ae of the two 
communities studied, a random sample, or all three. For further criti@ 
comments on the method, see R. Firth, “An Anthropologist’s View of 
Observation,” Sociol. Rev., (London), 31:166 1939- 

2 Paul Lazarsfeld and Marjorie Fiske, ‘Panel’ as a New Tool for 
yea Opinion,” Public Opin. Quart., 2:596-612, 1938. 

id., pp. 5-10. 
=P. F. Lazarsfeld, “Panel Studies,” Public Opin, Quart, 4:122-28, 19% 
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marized admirably the advantages and problems of the panel 
technique as follows : °° 


1. If, ina special study, one is interested in more than a straight 
“yes” or “no” answer to a question, and needs elaborate com- 
ments by the respondents, it takes considerable time to get the co- 
operation of people and to improve their ability to express them- 
selves. If such an effort has been successful with one group, one 
can capitalize on it in each subsequent interview. For example, 
it is hardly possible to use a different group of people every week 
if detailed comments on a radio program or on issues of a maga- 
zine are needed. 

2. If one is interested in relating the opinions of people to 
their personal characteristics, the repeated interview gives us an 
opportunity to accumulate knowledge regarding the correspond- 
ents. It would be hardly possible in one interview to ask people 
about their leisure-time activities, give them a personality test, 
collect biographical information, and so on. But it is possible to 
secure part of this information each time we return to the panel. 

3. Under certain circumstances, the statistical reliability of 
repeated interviews with panel members is greater than that of 
answers gained from a series of distinct samples.*# A chan of 
a few percent might be reliable in the former, but unreliable in 
the latter case. The panel, therefore, can be smaller than the 
serial sample. It is also technically easier to get good representa- 
tion in one panel than to get the same effect with a great number 
of samples. ony 

4. In studying the attitudes of poopie toward an event which is 
itself extended in time, the panel is probably the only adequate 
tool of investigation. . . y 

5. In certain cases, the panel is a necessary substitute for the 
use of a control group, which latter is not as dependable a tool of 
research as is generally assumed. The use of control groups is 
taken from the physical sciences. The student of agriculture, for 
example, applies fertilizer to one part of a field and does not 
apply it to the other part of the field. The difference in yield 
between the two parts, if reliable, can be safely taken as the 
effect of the fertilizer. In social research, however, the situation 
is not so simple. If we compare, for example, employed and un- 
employed people as to their political attitude in order to see what 
the political effects of unemployment are, we cannot assume that 
the employed ple are an adequate control group for the 
unemployed. Fhe control would be adequate only if we could 
take a number of people, throw half of them out of their jobs, 
and then see how their political attitudes change, compared with 

23 jorie Fiske, op. cit, pp- 1-5. 

ph Sangh G “oh t k poe Gee. F. inei, “Panel Studies,” 

op. cit. 
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the employed group. But in any concrete research situation, the 
“control group” might have become unemployed for reasons 
which themselves affect the political attitudes. Most of the control 
groups available for social research are “self-selected” in this 
sense. By comparing, for instance, farmers who have radios with 
farmers who do not have radios in a rural non-electrified area, the 
latter group would prove to be, at the same time, the poorer 
farmers who cannot afford batteries. Therefore, any comparison 
between the two groups might pertain to economic differences 
much more than to the influence of radio ownership. 

Under certain conditions the panel is preferable to the control 
group method. .. It might have been possible, for instance, to 
ask a group of people how they felt about the Reorganization 
Bill before a speech by the President, and then repeat the ques- 
tion immediately after the speech had been delivered. Certainly 
this procedure would be more valid than comparing the opinions 
of people who have and who have not heard the President be- 
cause in the latter case the opinions might influence the willing- 
ness or unwillingness to listen more than the listening might in- 
fluence the opinions. 

6. The saving in cost, which the use of a panel would mean, is 
quite obvious and has already been mentioned. 

The problems involved in using the panel are closely related to 
the advantages. Once people have been induced to cooperate in 
a panel, their patience can be taxed much further. But how diffi- 
cult is it to get people's participation? How great is the mortal- 
ity even among those who have participated ? 

It is further probable that people who join a panel grow more 
articulate than those whom we approach by mere doorbell ring- 
ing for a short poll questionnaire. But not all of them will be 
articulate and if we use the opinions of those who talk much, 
don't we introduce a new fallacy? The problem of articulateness 
is not peculiar to the panel, but it does not enter into a poll 
where we ask only very simple questions. It becomes an essen 
problem in a panel, however, when it is set up for just this pur- 


A poll sample ps pate only that it be representative for the 
whole po ulation oi interest and large enough for the returns to 
be reliable. A ury introduces a new element. It has to be rep- 
resentative too, but it differs from the poll sample by the fact that 
the individual approaches the test knowing that he will be called 
upon for an opinion, „To give a concrete example: in a poll we 
ask people how they liked the speech they heard yesterday —in a 
jury we ask the members to pass am p on a speech they will 
presently be hearing. This “critical set” is a problem for the 
pes method as well as for the jury method. The panel also 

as to be statistically representative and large enough, just a$ 
poll and jury. The additional problem introduced by the panel 
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is that we do not know what becomes of this “critical set” as a 
result of the repeated interviews. Do people become more and 
more critical— drift, so to speak, into a professional attitude — 
more and more divorced from the life situation of the general 

ublic? Or must we reckon with an increasing “freezing” of the 
attitudes of the panel members? This freezing could conceivably 
occur with, for example, pla who have once explicitly stated 
their political attitude and are consequently less likely to change 
their minds than the general public. It might be, also, that 
people who have repeatedly tasted a manufacturer's product be- 
come less discriminating. 

So it appears that the use of the anel as a measurement of 
ublic opinion holds great promise ; but it also opens new areas 
a research which must be made before it can be used with 

safety. 


G. THE PARTICIPANT OBSERVER 


We have been concerned above with the more or less fors 
mal interview in which the recognized interviewer-inter- 
viewee relationship exists. Some of the most enlightening 
field work has been carried out by means of a far more inti- 
mate and informal set of relationships between the investi- 
gator and the investigated, namely, through the so-called 
“participant observer” technique. It consists of the observ- 
er’s becoming as nearly as may be a member of the group he 
investigates. That is, he settles in the community and par- 
ticipates in the everyday life of the group. The degree to 
which this is possible will vary with the characteristics of the 
investigator in relation to the culture of the community he 
has chosen to investigate. There are the instances of ex- 
plorers like Knud Rasmussen* and Peter Freuchen %® who 
settled down among the Eskimos and adopted their ways of 
life for periods of several decades. Freuchen, for example, 
married a native woman, reared children, and in all other 
ways shared the Eskimos’ life. It is difficult, of course, to 
distinguish between fact and fiction in some travelogues and 
other accounts of adventurous journalists and lecturers. But 
when men of scientific training and ideals produce such ac- 
counts, they may reveal aspects of a culture that more formal 
methods can not portray. > 

Less extreme forms of the participant observer technique 


25 Knud Rasmussen, Across Arctic America, (Putnam, 1927). 
26 Peter Freuchen, Arctic Adventure, (Farrar and Rinehart, 1935). 
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combined with more formal inquiry are such investigations 
as the Lynds’ Middletown," in which the investigators par- 
ticipate for a period of some weeks or months in a culture 
very much like their own. In any case, the principle is to 
observe by sharing as nearly as may be in the behavior ob- 
served. It is an attempt to reduce to a minimum the distor- 
tion of actual situations which may arise from the fact that 
the observer is an outside agent.?8 

A good critical account of the participant observer tech- 
nique in social research is Florence Kluckhohn’s summary 
of her experience in a Spanish-speaking village in New Mex- 
ico.” The student should study her paper in full. We can 
reproduce here only a few extracts. 


_ In the application of the technique it is essential to realize that 
it is not only the investigator himself but also the members of 
e group being studied who are to regard him as a participant. 
This means that he must achieve status within the community 
organization, must play his part in a constellation of roles. . - 
The basic factors which determine general roles are such objec- 
tive states or conditions as sex, age, position in a caste or class, 
and race. . . j 
_ The would-be participant must learn something of the manner 
in which these determinants of general roles are dealt with in the 
social organization of the particular community in question. Ifa 
neral knowledge of the social organization is not obtainable 
studies of similar communities, there must be a period of 
watchful waiting. . . 

The investigator is never able to shake off entirely his role of 
outsider, and I am in accord with those who maintain that it is 
not advisable for him to do so. Some exceedingly valuable in- 
formation comes to the outsider simply because he is one. It is 
the investigator's task, however, to fashion all his roles, including 
this one, into tools ; and the role of outsider can be an excellent 
device if one is in command of it. The critical point here woul 
seem to be whether the community categorizes one as a particular 
kind of outsider or just vaguely an outsider. It is once again a 
matter of orientation. . . z 

What is the value of being a participant-observer? First, it 


3R. S. and H. M. L Middletown, (Macmillan, 1929), p- 506- . 

38 The writings of C, H. Cooley on “sympathetic Maso” of experience 
in social research have never been surpassed for clarity of exposition of this 
subject. See especially Human Nature and the Social Order, (Scribnets, 
goa), Chap. 4. Also Sociological Theory and Social Research, (Holt, 198°) 

e anad 10, 

The Participant Observer Technique in Small Communities,” Ame": 

Jour. Sociol., 46:331-43. 1940. 
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would seem probable that the opportunities for observation are 
increased and that in some respects greater objectivity and ac- 
curacy insured. Second, it sad appear that a use of the tech- 
nique will provide a desirable balance between the more purely 
behavioristic type of investigation and the type which seeks some 
measure of insight into the “meanings” current in the community. 

In treating the technique comparatively its advantages and dis- 
advantages are considered in terms of what we shall call the three 
“R's” of evaluation, the range, relevance, and reliability of the 
data obtained. Does the technique increase the field of selec- 
tion? Does it provide data which are more relevant to the 
interests and activities of the group in question? ... And does 
it provide more reliable data than certain other techniques ? 

In four rather obvious respects the range of data would seem 
to be increased by use of the technique. First, it affords access to 
the data which come from observation in the current situations 
in which the community members are involved. The possibility 
of a large and consistent body of such information is peculiar to 
the technique in question, for, if the investigator seeks to gain’ 
information by the direct interview, the questionnaire, or the 
life-history, he is creating special situations, not participating tin 
those which arise within the group itself. Interviewing indirectly 
while helping a woman with her washing is very different from 
interviewing directly in the woman's parlor or one’s own tem- 
porary office. Even though the answers to the mein may be 
the same, which is doubtful, what one observe is far from the 
same. 

To some extent, of course, even the participant observer has an 
effect upon situations. He cannot wait, for example, for persons 
to recall voluntarily past events or to explain the relationship of 
Ato B. He asks leading questions at some points and, in doing 
so, alters the situation, but he is altering given situations in vary- 
ing degrees, not creating special situations. 

The range of available data is likely to be increased in a second 
respect. There are in all prone certain kinds of data which are 
guarded more closely than others. Direct questions regarding 
such information may be met with evasions if not outright mis- 
representations. Indirect questions may also fail. Simulation of 
behavior made possible by participation may, however, open the 
door to this guarded realm. Space does not permit illustration of 
this point, bat to indicate what is meant: It was through a 
gradual simulation of the Spanish-American woman's fear that 
I obtained most of my information about witchcraft and other 
beliefs which are jealously guarded by the Spanish-Americans. 

The ready access to gossip is the third respect in which par- 
ticipant observation serves to increase the range of information. 

' Gossip is an important source of information both for what is 
said and for what is not said. The comments made about be- 
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havior often give checks to and judgments upon behavior which 
can be obtained in no other way. Moreover, it is especially im- 
portant to listen to gossip if one wants to discover the patterns of 
“socially approved” behavior. A bit of behavior is observed. It 
may strike the outsider as unusual: If community members be- 
gin to comment upon it, and especially if they condemn it, it is 
relatively safe to assume that it is unusual behavior for the com- 
munity in question: If, however, there is no comment whatso- 
ever, it is also reasonably safe to assume that it has not struck 
community members as being deviant behavior. 

The fourth respect in which range is increased has already been 
suggested. It lies in the advantage derived from being in the 
position of the person who is being taught.2% It has long been 
an argument of field research workers that the prestige of being 
an outsider is of great importance, that people talk because they 
are flattered by attention from an outsider, This does apply toa 
certain type of person, but it may be added that all people, even 
the simplest, have “egos.” While some individuals are flattered 

aby attention and are happy for the opportunity to talk at length, 

e egos of research workers themselves have led to over-emphasis 
of this factor and to too little regard for the detrimental effects of 
a suspected attitude of superiority and condescension. . . 

There are, of course, some disadvantages in participant ob- 
servation. Participation in a community's choral roles may well 
restrict the range of observation and interview in one direction 
while increasing it in another. In a rigidly stratified community 
it may be disadvantageous inasmuch as participation in one seg- 
ment may preclude participation in another. Even in relatively 
homogeneous communities there is always some restriction. Par- 
ticipant observation may also force one to sacrifice a certain type 
of uniformity in data. It will not be possible to put a given 

uestion to a large number of ple —at least not in the same 
orm. Attitude tests, for example, or house-to-house inquiries on’ 
Various matters cannot be made. For those problems where su 
inquiries are essential participant observation will be a hindrance 
rather than a help. But in small communities such information 
can often be obtained indirectly and more accurately, for there 


will be less danger that surface uniformities are obscuring subtle 
differences.2% 


In short, different problems require different methods of 
approach, and different approaches yield different types of 


2 Cora Du Bois, “Some Ps ical jecti echniques in 
Ethnology,” Jour. Sociol Pocket, gaat k — - * ™ 

** George A. Lundberg and Margaret Lemie (“The Sociography of Some 
Community Relations,” Amer. Sociol. Rev., $:918-36, 1937, especially pP- 
326-27) point to certain limitations of the questionnaire method in procut 
ing friendship patterns which would seem to bear out this point. 


se 
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results. All of them are valuable for their respective pur- 
poses, and with respect to the type of result sought. Their 
combination yields the fullest description of the many-sided 
phenomenon of social behavior. 


H. THE LIFE HISTORY OR CASE HISTORY 


The modern life-history document is a lineal descendant 
of very ancient methods of sociological description and gen- 
eralization, namely, the parable, the allegory, the vision, the 
revelation, the myth, the story, and the novel.* The modern 
life history, however, is a detailed voluntary account of a 
person’s own view of the events of his life as revealed under 
the sympathetic questioning of a skilled inquirer. The ob- 
jective facts of such a story may be corroborated by the usual 
historical methods. But the life history aims to reveal not 
only these facts but their meanings to the informant. It is 
these meanings that are important in explaining his behav- 
ior. The illuminating character of life-history records, both 
as reflections of the general character of different cultures 
throughout the course of history and as the source of hypoth- 
eses regarding human personality and social behavior, is rec- 
ognized by everyone. As scientific material, much past and 
contemporary life-history material has been questioned on 
account of the informality and subjectivity of the observa- 
tions. Yet such studies frequently describe the sequence of 
adaptive processes throughout the life of an individual, and 
in this way have great value in practical and therapeutic ad- 
justments as well as in providing a type of sociological 
knowledge. These values make it quite out of the question 
to dispense with such records, whatever their scientific limi- 
tations may be. 

Accordingly, recent students of the problem have properly 
devoted themselves to the development of techniques for 
the recording and analysis of life histories which will more 
adequately meet the scientific standards of observation and 
generalization. There is no reason why this should not be 
successfully accomplished. The assumption that there is 
any fundamental incompatibility between the life-history 


30 Cf. L. L. Bernard, “The Development of Methods in Sociology,” The 
Monist, 38:292-320, 1928. 
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“method” and the techniques with which this book is mainly 
concerned is clearly unwarranted. “As a matter of fact,” 
says Kimball Young, “many of the basic problems of [case 
history] method are not unlike those of statistics. There is, 
of course, a recognition of a set of specific or general factors 
to be studied. The môre careful case-history studies also 
recognize the problem of sampling, since the aim is to avoid 
the temptation to generalize from one case. So, too, refer- 
ences are frequently made to typical cases, which is a way of 
referring to some central tendency. Likewise, attention is 
given to the divergent instance, which is but an informal 
way of stating facts about variability. Finally, the worker 
with these data may compare his findings with those from 
other samples, and in order to expose dynamic relations he 
may indicate in qualitative terms covariation or correlation 
between selected factors or situations.” *! Bernard has stressed 
the same point in the following passage : 


The real source of methodological error [in case study] is in 
Supposing that statistical generalization can proceed without def- 
inition of data or units or that generalization can proceed from 
case data without some sort of statistical method. So-called com- 
monsense inferences from cases amount to nothing less than 1n- 
formal statistical generalizations. Formal statistical generaliza- 
tions are more adequate, when practicable,32 


_ The problem of how to observe, classify, and generalize 
life-history material remains largely to be solved, but there 
1s no reason to consider the problem insoluble. Important 
steps have already been taken toward the establishment of 
criteria for the more adequate recording of such material 
and the development of methods of generalizing from the 
material secured. Space does not permit us even to sum- 
marize the techniques of securing intimate personal life his- 
tories through protracted interviews of the type employed 
by psychoanalysts, psychiatrists, sociologists and the abler 
social case workers. A general idea of the more systematic 
approach may, however, be secured from John Dollard’s 
standards for an adequate life-history technique : 

Ri pasih. tepes Arari Problems of Adjustment, (F. S. Crofts 

2L. L. + Op. cit, pp. 306-7, 314. 
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I. The subject must be viewed as a specimen in a cultural 
series. 

II. The organic motors of action ascribed must be socially 
relevant. 

Ill. The peculiar role of the family group in transmitting the 
culture must be recognized. a 

IV. The specific method of elaboration of organic materials 
into social behavior must be shown. 

V. The continuous related character of experience from child- 
hood through adulthood must be stressed. 

VI. The “social situation” must be carefully and continuously 
specified as a factor. 

VII. The life-history material must itself be organized and con- 
ceptualized.3 


The full implications and applications of these standards 
in actual cases have been discussed and illustrated in detail 
by Dollard. As in all research, the best grasp of a method is 
to be secured from a study of its actual use in conspicuously 
successful research. (See references at the end of this chap- 
ter.) Space permits us here to touch only on the aspects of 
the life-history method which have special relevance to in- 
terviewing.*+ 


What has been said in a previous section about the impor- 
tance of a sympathetic relationship between the interviewer 
and his informant, is, of course, of special importance in the 
life-history interview. The images which both participants 
in the interview have of each other, and the different roles 
which each assume will fundamentally condition the subtle 
interaction of personalities which characterizes especially this 
type of interview. It is this interaction which the inter- 
viewer must be able to observe and report objectively, al- 
though he is himself an integral part of the situation. Kim- 
ball Young has summarized the matter as follows : ** 


In identifying himself with the informant the interviewer must 
be able to sense or “feel” the former's aims, traits, attitudes, 
values, and other inner states. Yet in this understanding of the 


83 John Dollard, Criteria for the Life History, (Yale Univ. Press, 1935), p. 8. 
For some additional suggestions, see K. ponai op. cit, pp. 250-58. 

34 For an outline of the different npe of life-history materials, see Chap. 
VIII, pp. 220-21. The analysis of life-history documents is, of course, sub- 
ject to all the principles of criticism treated in our summary of the historical 
method (Chap. IV). 

3 K, Young, op. cit.. p. 257+ 
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role of the other, the interviewer must not overdo his identifica: 
tion and thus lose his own integrity and his critical capacities. 
The successful interviewer must have the ability to play a dual 
role—his own as manipulator of the situation’ and seeker of in- 
formation, and alsosinsimagination that of the subject or in- 
formant. Only in th y will he be able to follow the latter's 
answers, comments a nfessions sympathetically. In other 
words, he must learn to play another's role sympathetically in 
imagination, but he must not completely absorb such a role in his 
overt or verbal reactions. . . 

The competent interviewer must possess keen erceptive facul- 
ties and an accurate memory. Not only should he hear correctly 
what is said to him, but he will be alert to the overtones of the 
informant’s verbal and overt reactions, noting changes in voice, 

vindications of feeling-emotional states from facial or other ges- 
tures, and ov yeaa possible clues to inner states which may 
subsequently ex And, while he must know how to 
direct the session with skillful and revealing inquiries, he must 
also possess the capacity for sympathetic listening. Often in in- 
terviews, especially with persons under the emotional stress of 
mental conflicts, once the informant has begun to pour forth his 
troubles, the interviewer need but listen attentive y in order to 
in significant information about the fundamental inner life of 

s subject.35 
In summary, we may say that the life history in some form 
or other remains the great common-sense source of informa- 
tion about social behavior. Nearly all of the social knowl- 
edge by which the average person makes his adjustments, as 
well as the overwhelming proportion of the knowledge on 
the basis of which social agencies carry on their work, con- 
sists of informally generalized case histories. The principles 
thus derived, with all their shortcomings, owing to inaccu- 
rate observation, inadequate sampling, and faulty inference, 
are unquestionably the guiding tenets by which most men 
live and much social administration is carried on. Scientifi- 
cally derived generalizations have not as yet supplanted to 
any great degree the accepted conclusions from novels and 
voluminous but unsystematically generalized case histories. 
Representing, as these histories do, a kind of record of ex- 
perience, it 1s reasonable that in the absence of more reliable 
information and principles, the well-known case, although 
it may be chiefly a Stereotype, should be highly valued. At 


5 On the art of listening, see N. Diehl and R. S. Wilson, “Can Listening 
Become a Case Work Art?” The Family, 1499-105, 1993. 
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the same time, the fallacy of using a mere illustration as 
proof is a very common and mischievous one. 

In the meantime, important work is going forward in the 
production of more systematic, reliable, and relevant case 
records. We have noted the great va of life-history mate- 
rial in suggesting hypotheses for moré rigorous scientific tests, 
in throwing light on past situations giving rise to present 
conduct, and in a great variety of practical adjustment situa- 
tions. We have also noted the tendency of case studies to 
neglect the known principles of valid sampling, generaliza- 
tion from inadequate data, and the lack of standardized and 
objectified procedures permitting corroboration of both data 
and generalizations. We also have to consider the possibili- © 
ties of errors in perception and reporting, failures of mem- 
ory, unconscious omissions or repression of the unpleasant, 
distortion of recall, the tendency to dramatize and describe 
a desired imaginative role, and the temptation to project 
upon his subject the interviewer's own preconceptions, and 
to force the data into the theory with which we approach 
our task. 

Many of these dangers are common to all scientific work, 
and cannot be considered peculiar to the life-history ap- 
proach, although here some of them are undoubtedly espe- 
cially difficult to avoid. Science must proceed, however, on 
the hypothesis that none of these difficulties are insuperable. 
The improvements already noticeable in the life-history 
techniques are gratifying and promising. As we proceed 
with this work the apparent differences between the life-his- 
tory technique and other methods will tend to disappear. 
As Robert E. Park said many years ago regarding generali- 
zation from case histories: “The procedure here is just the 
same as in any of the natural and physical sciences. We ex- 
plain them by putting them under some general category, 
classifying them, in short, and then discovering where we 
can, the reason for the deviation.” 3° i 

There is no doubt that the above is a correct characteriza- 
tion of what is actually done in the case of life histories, as 
well as in the case of records of other behavior segments how- 


3R. E. Park, “Experience and Race Relations,” Jour. Applied Sociol., 9: 
22, 23, 1924. 
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ever secured and of whatever degree of complexity. The 
frequently assumed fundamental difference between case his- 
tory and other sociological methods is, therefore, undoubt- 
edly exaggerated. The different methods sometimes achieve 
the same result.®? e main difference lies in the informal- 
ity, and therefore in the verifiability, of the steps by which 
generalization from life histories is carried out. Since a clear 
account of the processes by which we reach our conclusions is 
a principal requirement of scientific method, this retnains 
the chief problem in the use of life histories as scientific 
material. 
The logic of generalization from case studies has been 
` trenchantly considered by Stouffer and Lazarsfeld.** They 
find that “rigorous treatment of case history materials im- 
plies dependence on statistics and tends to be almost futile 
if a multiplicity of variables is involved.” To reduce the 
numbers of variables, the device of typological classification 
is suggested. Such classification in its most elaborate math- 
ematical form takes the form of multiple factor analysis as 
developed by Thurstone, Hotelling, Kelly, and others. The 
discovery of a high correlation between two or more vari- 
ables makes it practically possible to neglect certain combi- 
nations of attributes or to reduce them to clusters which may 
then be treated as single variables. Thus Chapin in the 
Social Status Scale, 1933, found that a large variety of attri- 
butes having to do with socio-economic status could be re- 
duced to seventeen items. Burgess and Cottrell applied, with 
suggestive results, such methods to their elaborate case rec- 
ords of different degrees of marital adjustment. The more 
refined methods of factor analysis are too involved to permit 
exposition here, but their possible contribution to the solu- 
tion of the most difficult problem in the scientific utilization 
of case histories should be noted. The tendency in case re- 
a ‘ P wr) i 
Methods in Attitude Research, (Ph D thats, Unierty of Chicago. 199 
For a brief summary of this study, see the Publ. Amer. Sociol. Soc, 25:154 
56, 1930. See also Ruth S. Cavan, P. M. Hauser and $. A. Stouffer, “Note 


on Statistical Treatment of Life History Material,” Soc. Forces, 9:200-03 
1990. 

88S. A. Stouffer and P, F. Lazarsfeld, Research Memorandum on the Fam- 

2 in — abies: (Social Science Research Council, Bull. No. 29. 1937) 
ppendix A. 
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cording is to reduce the data to the form of an elaborate 
schedule or questionnaire.*® The data thus lend themselves 
to statistical treatment. The argument that such procedure 
destroys the important “configurational” aspects of the data 
will probably be met in the end by typological methods in- 
volving the fundamental logic of faetor analysis. 

Finally, there should be mentioned the practical suggestions 
of Lazarsfeld and Robinson *® for the more precise and quan- 
titative classification of case studies. The proposed procedure 
seeks to formalize and render more precise the process by 
which discursive, literary case stories are now informally 
classified and evaluated. Consider, for example, the selec- 
tion of an employee on the basis of a great deal of informa- i 
tion about him. “One observes that the applicant speaks 
positively and seems to know his business but also that he 
avoids making decisions, and one estimates his desirability 
in the light of these conflicting indications. The selection of 
another applicant, however, may involve very different indi- 
cators ; for example, the way in which a man expresses him- 
self in a letter and the recommendations he encloses.” ** 

The method suggested by the above authors is as follows : 


A. Define the continuum in terms of which the case studies are 
to be classified. This requires an explicit statement as to the gen- 
eral quality or characteristic according to which the individuals 
will be classified. i VT? 

B. Decide which indicators (i.e., “bits of information”) in each 
case study are items on this continuum, i.e., are relevant to the 
classification. If the continuum is that of dominance-submission, 
any specific detail helpful in deciding whether an individual is 
dominant or submissive is an indicator on the continuum for that 
particular individual. Deciding which indicators are relevant to 
the classification thus consists merely in specifying or concretizing 
the definition of the continuum in terms of specific items of in- 
formation. con 

C. Give a numerical value, with sign, to each indicator, ac- 
cording to its position on the continuum. To continue the pre- 
ceding illustration, if a particular detail indicates that the in- 


3E, W. Burgess and L. S. Cottrell, Jr., Predicting Success or Failure in 


Marriage, (Prentice-Hall, 1939), Appendices B and C. See also Chap. 16 of 
this a iE Sona A discunion of the problem here under con- 


sideration. e p 
“OP. F. Lazarsfeld and W. S. Robinson, “The Quantification of Case 


Studies,” Jour. Applied Psychol., 24:817-25, 1940. 
4 Ibid., p. 819. 
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dividual is submissive, it might be designated by — 1, to show 
that so far as this particular detail is concerned the individual is 
on the negative side of the dominance-submission continuum. 
If the specific detail indicates that the individual is dominant, 
this fact might be indicated by a aN 1. A detail relevant to the 
classification, but indicative that the individual is neither very 
dominant nor very submissive, might be given a value of zero, 
A more detailed rating scheme might also be used if desired. 

D. Combine the “scores” on the different indicators for each 
case study in order to determine a final index of the positions of 
the case on the continuum. The most efficient eonim o 
the indicator scores for a single case is their arithmetic mean. 1I 
different numbers of indicators are found in different case studies, 
these means will of course be based on different numbers of ob- 
servations. Thus if in a particular case study three positive, one 

‘zero, and four negative indicators are recorded, the final index 


will be 3 ©) tes — \% = — .125. If for another 


case study there are four positive, two zero, and six negative in- 
dicators, the final index will be EEO +C) — — 2/12 = 
12 


- iad Computing indexes in this way allows the utilization of 
all relevant information found in the case study ; none need be 
thrown away. The total range of the scores will, of course, lie 
between — 1 and -+ 1. 


In combining the scores it is neither necessary nor desirable 
that the number of indicators be the same for the different cases. 
It may even be that none of the indicators used in classifying 
one case will be involved in classifying another." 


In conclusion it should be said that the difference between 
securing an expression of attitude through a life-history 
document and through the formal opinion questionnaire is, 
of course, purely a difference in the type of stimuli situation 
through which we elicit a verbal response. In the case of the 
life-history document it is assumed that these stimuli are more 
indirect, covert, subtle, and slantwise, and thus more disposed 
to secure an unbiased self-revelation, as compared with the 
more direct and pointed questionnaire of opinion. This isa 
doubtful contention for two reasons: (1) The questionnaire 
method of eliciting response is capable of the same subtleties, 
covertness and indirection as are the stimuli which elicit a 
life-history document. (2) It is certainly fallacious to assume 


#2 Ibid, pp. 818-19. 
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that even such documents as private letters and diaries are 
free from the pretenses, fantasies, and rationalizations that 
undoubtedly invalidate to a greater or lesser degree the. face 
value of answers to a questionnaire. Says Kimball Young: 
“In truth, the digging up of past experiences by association 
is open to considerable qualification. In turn, personal 
documents, such as letters, memoirs, diaries, and confes- 
sional autobiographies are open to the same criticism that 
all forms of self-rating, self-revelation, and self-analysis are. 
The real motivations, especially, are not easy to get at from 
self-analysis, even though the main features of the mental 
mechanisms may be thus uncovered. If we wish to push the 
matter fully into the realm of motives, we may have to go 
beyond the ordinary autobiographical procedure. Here the 
interview, the method of mental analysis used by the psy- 
chiatrist, and the objective check-up from other sources of 
information come into play.” * (Italics ours.) That the 
conditions which bias, warp, and invalidate the face value 
of the responses in the letter, diary, or life history probably 
will be different from those which achieve a similar effect 
in the case of the questionnaire, is undoubtedly true. But 
the assumed superiority of life-history documents, as such, 
over questionnaires, as such, is an erroneous notion which 
has probably arisen from the use of extremely crude forms of 
questionnaires with correspondingly crude results. The 
point is that the results of neither form of response can be 
taken at their face value. Both must be considered as re- 
sponses to a total situation, and their face value as indications 
of subjective attitudes, or as indices of action in a real life 
situation, can be approached only by detailed analysis of the 
total situation under which the verbal responses are secured. 
No one questions the value of life-history documents, how- 
ever, and they should be used for what they are worth. Per- 
haps their chief drawback is the difficulty of securing them 
in sufficient numbers and for a representative sample of a 
population. The greatest value of life-history documents 
probably lies in their capacity to suggest hypotheses for 
more intensive investigation. 


43 Kimball Young, “The Measurement of Personal and Social Traits,” Publ. 
Amer. Sociol. Soc., 21:103, 1927- 
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I. THE SOCIAL SURVEY 


The social survey is not a special method of research dis- 
tinct from the methods that have been discussed in preceding 
chapters. The survey is rather a type of organization of 
field work. That is, a social survey might employ any or all 
of the methods we have considered. There are no additional 
or distinctively “survey” methods. The term, “social sur- 
vey” is conventionally used to designate (a) a more or less 
comprehensive inquiry into numerous aspects of a situation, 
(b) as it exists at a given time in a given community, (c) with 
rather definite educational, propagandist, and ameliorative 
purposes.*# As such it may utilize any or all known meth- 
ods of inquiry. 

Historically considered, the practice of systematic inquiry 
regarding human resources and social conditions goes back 
to very early times. Thus, Herodotus records a survey of 
the population and wealth of Egypt about 3050 B.C. prepara- 
tory to the building of the pyramids. The Bible records 
censuses of population and taxable wealth. Such surveys 
were repeated by various Roman emperors and are illus- 
trated in early England by William the Conqueror’s prepara: 
tion of the Domesday Book. Since the sixteenth century and 
the rise of national states a very large number of censuses 
and other inquiries on a large scale have been made at regt 
lar and irregular intervals. The social survey in its more 
modern and restricted sense is usually thought of as begin 
ning with the work of Le Play in his Les Ouvriers Européens 
(1855) or with Charles Booth’s Life and Labour of the People 
of London (1891-1897). After that time, the latter type ° 
inquiry became extremely popular, especially in the Unite 
States® The ready availability of funds for such purpos’s 

“4 “Soci: " s, ice b 
BEANE ii piaia totals E a 
atam At Survey, p. 5. M. C. Elmer, Social Research, (Prentice-Hall, 1939) 


45 For an outline of the development of such inquiries since 1500 S€ G 
A. Lundberg, “Statistics in Modern Social ‘Thought.’ in Contemporary Social 
Ta edited by H. E. Barnes, H. and F. Becker, (Appleton-Century, 1949) 
ap. 6. 
W'A Bibliography of Social Surveys, by A. Eaton and S. M. Harrison Rus 
sell Sage Foundation, 1930), contains sore then 2700 references to SU 
completed up to January, 1928. 
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from the great research foundations doubtless contributed 
to this popularity and a vast number of good, bad and indif- 
ferent surveys resulted. The'survey movement in its tradi- 
tional form is perhaps already past the peak of its popularity 
and may be destined further to decline," at least as far as 
surveys under private auspices are concerned. Among the 
reasons for this trend are (1) the rising standards of sociologi- 
cal inquiries, (2) the shrinkage of private research funds, (3) 
a more discriminating scrutiny of “survey” projects by the 
Foundations, and finally (4) the great extension of govern- 
mental records and inquiries. Some of the data usually se- 
cured with great trouble and expense through surveys will 
probably be provided (a) by a more comprehensive registra- 
tion or regular collection of data by governmental agencies ** 
and (b) by scientific inquiries into specific problems proceed- 
ing on definite hypotheses, and employing more refined 
methods of the types reviewed in previous chapters. 

Since the form of inquiry known as the social survey is 
nothing but a coordinated series of investigations or re- 
searches into related problems, all the methods and princi- 
ples of observing and recording data previously considered 
in this book may properly be applied to the survey. The 
practice of classifying the survey as a distinct method of re- 
search, or even placing it on a coordinate basis with such 
methods as the historical method, the case method, and the 
statistical method is clearly erroneous, as a survey may, and 
usually does, utilize all of these methods. It could, in fact, 
have no existence apart from these methods. The term “sur- 
vey” merely indicates a certain degree of comprehensiveness 
and coordination with respect to the content or subject mat- 
ter of an inquiry. A social survey, then, is a series of in- 
vestigations or research projects undertaken with the pur- 
pose of emphasizing the relationship of the findings. It has 
no methods of its own, nor any techniques of gathering and 

47 Cf, Helen R. Jeter, “Research and Statistics in Social Work,” Social 


Work Year Book, Russell Sage Foundation, 1941. f f 
48 Consider, for example, the inclusion o0! questions regarding housin; 


and unemployment in the Federal Census of 1940; the large number o! 


the Work Projects Administration ; and such Ee hh surveys of 
the whole nation as are represented by Recent Social Trends, (McGraw-Hill, 


1933). 
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evaluating data other than those outlined in Chapter IV and 
discussed in subsequent chapters. Only in the emphasis on 
coordination of individual i tigations, and therefore in 
the organization of its field work, does it call for separate 
consideration. 

1. General procedure in social surveys, — Since the only 
factor which distinguishes the social survey as a form of in- 
quiry from other forms of investigation is its inclusiveness 
and the coordination of a number of individual research 
projects, the organization of the field work of the survey is 
the only aspect which concerns us in the present connection. 
Chapin has summarized the technique of organization for a 
survey as follows : 


The usual procedure is to get together a central committee 
representative of all the helpful and cooperative interests in the 
l community. An outside expert in survey work is then 
employed to plan and direct the investigation. This director 
usually finds it helpful to secure the cooperation of several state 
and national welfare organizations in carrying out the undertak- 
peek È Tak size-up o ae local situation is then made ee 
` is prelimi -taki sua ri 
to the Saiak boaiitions eane aa apie ar malad- 
justment and points the way to further intensive study in special 
fields.49 

The desirability of securing for field work the backing of 
a representative group of community agencies was discusse‘ 
in an earlier section. Whether or not financial backing 15 
sought, the approval of these agencies facilitates the field 
work and lays the foundation for community organization 
to carry out programs of social improvement suggested by 
the findings. 

The need of skilled direction in social surveys cannot be 
too strongly emphasized. Too many reports show clearly 
the biased hand of the propagandist. Too frequently sur- 
veys are undertaken, not to reveal all the facts, but to accu- 
mulate facts to support the program of the individual or group 
sponsoring or conducting the inquiry. Sometimes this bias 
is not even intentional but merely the result of lack of traim- 
ing in scientific method on the part of the director. Every- 
thing which was said in Chapter II regarding bias, and the 


$F. S. Chapin, Field Work and Social Research, (Century, 1920), P- 1° 
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need for intellectual honesty in observing first-hand data, ap- 
plies with special force to fhe direction and conduct of 
social surveys. 

After the scope of the survey has been determined in a 
general way, a quick size-up of the whole field by an expert 
is necessary as a basis for organizing the detailed field work. 
The general scope may vary from a city block in all its as- 
pects to an entire region, with national or international 
boundaries. As the geographic scope increases, it is usually 
necessary to limit the scope of the content or subject-matter 
of the survey. Thus the most recent survey of Pittsburgh is 
devoted chiefly to “an examination of agencies and institu- 
tions that provide social and health services.” °° The scope of 
a survey both in content and geographic inclusiveness is de- 
termined in the last analysis by its purpose and the resources 
available for carrying it on. To determine how the available 
resources can best be utilized to achieve the ends sought is 
the purpose of the preliminary exploration by the director of 
the survey. On the basis of his preliminary exploration, he 
determines what sources to canvass and what situations to 
investigate. Only after this preliminary inquiry is it possible 
to organize the field work efficiently and economically. 

2. Common defects of present social surveys. — The princi- 
pal defect of social surveys up to the present time has been 
the lack of standardized procedure and the resulting incom- 
parability of the data accumulated. Facts are of significance 
only in relation to other facts. Unless the data collected 
with reference to one community can be evaluated accord- 
ing to recognized standards, or compared to similar data in 
other communities, they are of doubtful scientific value and 
are unsafe as guides for social action. Too many surveys are 
nothing but collections of data ona variety of topics without 
any attempt to evaluate the facts with n to recog- 
nized standards or to compare them with similar facts in 
other communities. If evaluation is attempted, it is too 
frequently on the basis of the surveyor’s subjective judgment 
of what are reasonable standards, or to further the ends for 
which the “survey” was undertaken. f 

It is in connection with the need for comparison and 


50 P. Klein, A Social Study of Pittsburgh, (Columbia Univ. Press, 1938). 
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evaluation of the data collected that the standards formu- 
lated by state or national organizations in various fields be- 
come valuable guides. Some Of these standards, as well as 
the standardized measuring-sticks developed in various fields 
for scoring institutions, have been discussed at length in 
Chapter IX. By means of such devices social surveying may 
become standardized, and the results have all the significance 
of accurate social measurement with reference to definite 
norms. With the development of a complete set of meas- 
uring instruments of the type considered in Chapter IX, social 
surveying would become mainly the technique of applying 
these measuring-sticks to the institutions and activities of a 
community and reporting the results in standard and com- 
parable terms. This practice would vastly increase the value 
of social surveys. First, it would make it unnecessary for 
each surveyor to invent his own methods of measurement. 
We have considered in preceding chapters the difficulties in- 
volved in constructing measuring scales, and the need for 
thorough €xperimentation and revision before a reliable 
measure can be attained. The use by surveyors of their own 
untested devices is almost certain to result in conclusions of 
very dubious value. Second, the utilization of standard 
methods at once makes the findings of significance by ena- 
bling comparisons with recognized norms, and with other 
communities. By the development and use of standardized 
methods the social survey may become a scientific enterprise 
of significance to social research instead of a “fad and an in- 
dustry,” 51 


J. CONCLUSION 


_ The development of social science will demand the gather- 
ing of data in the field to an increasing degree. The princi- 
ples of sociology, economics, and political science cannot be 
deduced from armchair speculations nor from the ponderous 
philosophical tomes on human nature. The objective obser- 
vation and recording of actual group behavior as it occurs 
is a constant necessity. 

The collection of first-hand data on a large scale involves 


#1 F, H. Giddings, The Seienti; jety, (Univ. of North 
Carolina Press, 1924), p. 185. i oe nanan Suen Some 
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many problems in the selection, training, organization, and 
supervision of field workers. It involves also the develop- 
ment of a reliable and efficient technique of stimulating, ob- 
serving, recording, and evaluating human responses, whether 
carried on in the more or less formal interview or otherwise. 
This technique is as yet somewhat undeveloped, but is be- 
coming the object of such study and analysis as promises 
much for its improvement. Finally, the social survey has 
made its appearance in recognition of the need for factual 
social data as a basis for social adjustments. Most of the 
social surveys up to the present time, however, have suffered 
from the lack of standardized methods of social measurement, 
and a failure on the part of social surveyors to apply the bet- 
ter measuring devices which do exist. Being itself merely a 
coordinated series of investigations employing a variety of re- 
search techniques, the results of a social survey cannot, of 
course, have any greater reliability than the individual tech- 
niques employed are capable of securing. In short, the spo- 
radic nature, the lack of standardized technique, and the lack 
of skilled direction of social surveys have made the data col- 
lected of doubtful scientific value. 

Many of the questions which it is hoped can be answered 
by a social survey cannot in fact be answered by this type of 
inquiry. Some of the necessary data can be secured only 
through a more accurate, general, and standardized recording 
of certain social events as they occur. If there existed reliable 
and comparable records of all relevant social events and 
processes, such as now exist, for example, for births and 
deaths, social surveys for almost any social grouping could 
be conducted pretty much as an accountant checks the books 
of a corporation and reports on its status. In short, many 
social surveys are merely attempts to make up for inadequate 
community bookkeeping. Inadequate and unstandardized 
recording of social data is a major handicap both to practical 
administration and to the development of the social sciences. 
We turn, therefore, to this problem in the next chapter. 
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K. SUGGESTIONS EOR FURTHER STUDY 


1. For agood brief summary of the history of modern s0: 
cial surveys, see Pauline Young, Scientific Social Surveys and 
Research (Prentice-Hall, 1939), Chaps. 1 and 2. See also her 
bibliography on this subject, pp. 535-41- 

2. For a detailed consideration of interviewing, with spe 
cial reference to social work, see Pauline Young, Interview- 
ing in Social Work, (McGraw-Hill, 1935). A more general 
treatise is W. Van Dyke Bingham and B. V. Moore, How to 
Interview, (Harper, 1931), especially Chap. 6 (“The Interview 
in Marketing Studies and Commercial Surveys,”) and Chap. 7 
(“Interviews in Social Case Work”). Extensive bibliog- 
raphies are to be found in this volume. Especially recom- 
mended as supplementary reading to the present chapter is 
The Technique of Marketing Research, (McGraw-Hill, 
1937), Chaps. 3 and 4, by P. F. Lazarsfeld. 

On the reliability and consistency of information secured 
through interviews as checked against other records, see Ger- 
trude Bancroft, “Consistency of Information from Records 
and Interviews,” Jour. Amer. Statis. Asso., 35:377-81 1940 

g. On the effects of bias on the part of the interviewer, S¢¢ 
S. M. Harvey, “A Preliminary Investigation of the Inter 
view,” Brit. Jour. Psychol., 28:263-87, 1938. Also S. A 
Rice, “Contagious Bias in the Interview,” Amer. Jour. Sociol., 
85:420-23, 1929. 

4. For an excellent appraisal of the life-history technique, 
see H. Blumer, Critiques of Research in the Social Sciences: 
I. An appraisal of Thomas’ and Znaniecki’s The Polish Peas- 
ant in Europe and America, (Social Science Research Coun- 
cil, 1939). This publication includes also the report of a 
panel discussion of the subject by a number of social scien- 
tists from different fields. See especially the admirable sum 
mary by Read Bain, pp. 191-202. For an especially compe 
tent series of case studies and a careful consideration of the 
scientific problems involved, see Mirra Komarovsky, 
Unemployed Man and His Family, (Institute of Social Re 
search, New York, 1940). See especially the Appendix. 

5. “An Outline for Writing a Case History,” will be found 
in K. Young, Personality, and Problems of Adjustment, 
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(Crofts, 1940), Appendix. For a more detailed treatment, see 
John Dollard, Criteria for the Life History, (Yale University 
Press, 1935). For recent examples of the technique in cur- 
rent surveys, see John Dollard, Caste and Class in a Southern 
Town, (Yale University Press, 1938) ; Allison Davis and John 
Dollard, Children of Bondage, (American Council on Edu- 
cation, Washington, D. C., 1940). 

6. On the “participant observer” technique see Florence 
R. Kluckhohn, “The Participant-Observer Technique in 
Small Communities,” Amer. Four. Sociol., 46:331-43, 1940. 
For examples of the use of this method, see William F. Whyte, 
“Corner Boys: A Study of Clique Behavior,” Amer. Jour. 
Sociol., 46:647-64, 1941; Pauline Young, Pilgrims of Rus- 
sian-Town, (University of Chicago Press, 1932); Nels An- 
derson, The Hobo, The Sociology of the Homeless Man, 
(University of Chicago Press, 1923); C. M. Arensberg and 
S. T. Kimball, Family and Community in Ireland, (Harvard 
University Press, 1940). 

7. On case recording as practiced by social 1 rk agencies 
see Florence Hollis, Social Case Work in Practice, Six Case 
Studies, (Family Welfare Association of America, New York, 
1939). Also Margaret C. Bristol, Handbook on Social Case 
Recording, (University of Chicago Press, 1936). 

8. An exhaustive bibliography of social surveys completed 
up to January 1, 1928, will be found in A. Eaton and Shelby 
M. Harrison, A Bibliography of Social Surveys, (Russell Sage 
Foundation, 1930). The most adequate recent bibliography 
of various governmental surveys and research projects is the 
Index to Research Projects, Vols. I-III (Works Progress 
Administration, 1938-39) and Catalogue of Research and 
Statistical Publications, (Division of Social Research, Work 
Projects Administration, March, 1941). A valuable book 
suggesting the types of questions that surveys are frequently 
designed to answer is Joanna C. Colcord, Your Community 
(Russell Sage Foundation, Second Edition, 1941). 

g. For excellent examples of recent surveys see: The So- 
cial Survey of Merseyside, edited by D. Caradog Jones, (Uni- 
versity Press of Liverpool, Hodder and Stoughton, Ltd., Lon- 
don, 1934), 3 Vols; Calvin F. Schmid, Social Saga of Two 
Cities, An Ecological and Statistical Study of Social Trends 
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in Minneapolis and St. Paul, (Minneapolis Council of Social 
Agencies, 1937); Philip Klein, A Social Study of Pittsburgh 
Community Problems and Social Services of Allegheny 
County, (Columbia University Press, 1938). 

10, Twila E. Neely, A Study of Error in the Interview 
(N. Y., Columbia University Press, 1937) . Also recent studies 
in Sociometry, 5:109-134, May 1942. 

11. C. R. Rogers, “The Nondirectiive Method as a Tech- 
nique for Social Research”; Amer. Jour. Sociol., 50:2'79-283) 
Jan. 1945. 

12. Jessie Bernard, “Observation and Generalization in 
Cultural Anthropology”, Amer. Jour. Sociol, 50:284-291) 
Jan. 1945. 
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SOCIAL BOOKKEEPING 
at 


There are in the United States today 88 different agencies of the federal 
government collecting statistics about economic and social conditions. This 
amount appears to be a great deal to those who made out the 135 million 
returns on 4700 different forms in 1938, not a regular decennial census 
year. But a state cannot administer its affairs without knowledge of the 
conditions it deals with, any more than engineers cin build the Grand 
Coulee dam without expert knowledge about steel and concrete. The en- 
gineer learns about his materials in the laboratory, while the government 
obtains its knowledge from statistics. In both cases the knowledge needs 
to be precise, the result of measurement. What if the census costs 
forty million dollars, it is necessary and worth it to the state, as well as to 
the uncounted users of census data... On the basis of the journal pub- 
lications using statistics, the sciences are ordered as follows, psychology, eco- 
nomics, sociology, anthropology, political science, and history. That statis- 
tics is becoming a method of measurement in all these sciences is evidenced 
by the fact that during the past third of a century, the proportion of space 
used for formulae, tables and charts in these particular journals has in- 
creased 115 per cent. — William F. Ogburn1 


A. SOCIAL ORGANIZATION AND THE GATHERING OF SOCIAL DATA 


If it is conceded that the advancement of thessocial sciences 
is dependent upon a more comprehensive collection of ac- 
curate and comparable social data, then the problem of pro- 
viding for the systematic collection and filing of such data 
becomes of primary importance. The difficulties of con- 
trolled laboratory experimentation in the social sciences and 
the consequent inability to speed up artificially the particu- 
lar social process which we wish to observe, make it all the 
more necessary to record social data as they occur over long 
periods of time. There are many vital types of social processes 
which cannot be reliably analyzed and interpreted except 
over periods of time embracing many generations. This 
characteristic of many social data, as well as the fact that we 
must have to some degree the cooperation of people in gen- 
eral for the collection of such data, makes us dependent to a 
large extent upon public authority for any comprehensive 
survey of social conditions and processes. The problem of 
the systematic gathering of social data on a large scale be- - 
comes, therefore, a problem of social organization. Hence, 


1 “Statistical Trends,” Jour. Amer. Statis. Asso., 35:256, 258 (Part II), 1940. 
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the problem of keeping the organization free from partisan 
and political influence is the Bre as in the case of organ- 
izations for other purposes. This becomes especially im- 
portant in the case of an organization that is collecting data 
upon which social and political administration largely de- 
pend. 

The necessity of keeping public records of social data has 
for various practical reasons been recognized since ancient 
times. Rulers and governments have always found some re- 
cording of vital, economic, and other social statistics indis- 
pensable to the guidance of public affairs. While these rec- 
ords have usually been regarded as useful chiefly from the 

tandpoint of practical administration rather from the stand- 

point of data for science, these data have nevertheless become 
the basis for whatever generalizations underlie public pol- 
icy. Not only is this true of the records kept by govem- 
ments, but most institutions, public and private, have enor- 
mous archives of records. Taken together these constitute a 
collection of millions of facts which, if they were intelligible, 
would represent an important record of social experience 
from which scientific generalizations of great importance 
might be drawn. But unfortunately these records are largely 
useless on account of their incompleteness, inaccuracy, and 
incomparability. In short, because the principles of collect- 
ing data emphasized throughout this book have not been 
observed in the recording of the mass of data found in pres- 
ent-day government and institutional archives, these data are 
for the most part largely useless for scientific purposes. 


B. THE DEVELOPMENT OF NATIONAL AND INTERNATIONAL 
STATISTICS 


_ With the expansion of political units and the rise of na- 
tional and international economic contacts and political 
organizations, the need for uniformity in methods of record- 
keeping has received increasing attention. Some of the 
older and stabler countries of Europe have partially compa- 
rable records extending over several centuries, not to mention 
the highly systematized methods of recording certain CON- 
temporary vital data. Not only has the need for uniformity 
and standardization within particular countries found €x- 
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pression, but since 1851 there have been comprehensive 
efforts to bring about interfational cooperation in securing 
standardized methods of recording vital social data. Will- 
cox designates the London International Exhibition of 1851 
as the beginning of international statistics. At this meeting, 
Quetelet suggested that an International Statistical Congress 
should soon be held for the purpose of securing greater com- 
parability in statistical publications. This congress met in 
Brussels in 1853 under the presidency of Quetelet. It was 
followed by eight others during the following thirty-five 
years and by the establishment of a Permanent Commission 
to further international cooperative work in statistics. This 
series of meetings came to an end in 1878 and “somewhat th 
same international purpose was served thereafter by the In- 
ternational Congress of Hygiene and Demography which 
held its latest and perhaps its last session at Washington in 
1912. At the Jubilee meeting of the Royal Statistical Society 
a new organization was started, modeled somewhat on the 
earlier Permanent Commission and with a limited profes- 
sional membership. It met biennially, 1895-1913, and again 
in 1923.”2 In the meantime the International Statistical In- 
stitute at The Hague had carried on its work since 1885.° 
When the international meeting of 1923 took place, the need 
of the League of Nations for fuller and more trustworthy 
and comparable figures about the economic conditions of 
Europe lent especial significance to the systematic collection 
of certain data on an international scale. Recognizing the 
need for factual data as a basis for international as well as 
national government,‘ the League, in spite of its recent vicis- 
situdes, still (June, 1941) publishes the Monthly Bulletin of 
Statistics and the Statistical Year-Book of the League of Na- 
tions. These publications contain comprehensive statistics 
for most nations on demography, production, labor, employ- 
ment, cost of living, prices, finance and other subjects. 


2 Walter F. Wilcox, “The Relation of the United States to International 


Statistics,” Publ. Amer. Sociol. Soc., 18:18, 1923- Ges) 
8 Lucien March, “International ae, and the League of Nations, 
Publ. Amer. Statis. Asso., 17 (N. S.):635-38, 1920-21. n 
4 For a account of the didon leading vp to the establishment of the 
League's statistical service, see “Report of the International Statistical Com- 
mission Appointed by the Council of the League of Nations,” Publ. Amer. 
Statis. Asso, 17 (N. S.):632, 1920-21. 
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C. THE UNTIED STATES CENSUS BUREAU 


The United States was represented at most of the interna- 
tional gatherings mentioned above. The first decennial cen- 
suş was taken in 1790, but the organization and scope of the 
work was rather rudimentary until 1850. A permanent Bu- 
reau of the Census was not established until 1902. While 
individual states, cities, and private organizations are, as We 

shall see, engaged in a great deal of more or less systematic 
collection of various kinds of data, the Federal Census Bu- 
reau remains our most important single agency for gathering, 
standardizing, and publishing statistical information. The 
cee decennial census alone is an enormous undertaking 
vhich is constantly increasing its activities. We have already 
reviewed in a previous chapter (Chapter XI) the organization 
and conduct of a national census. The Census Bureau is 
also constantly engaged between the decennial censuses in 
the collection and analysis of vital statistics, agricultural, 
commercial, and other data. Among the more important of 
these activities may be mentioned: (a) special studies, such 
as the recent study on “Comparative Occupation Statistics, 
1870 to 1930"; (b) preparation and revision of intercensal 
and post-censal population estimates ; (c) a decennial census 
of religious bodies; and (d) the collection of data on mental 
patients, and statistics of criminal and judicial trends. This 
includes the establishment of a prison register to serve as â 
current inventory of the prison population, and a special 
sampling study of prisoners in the jails of the District of Co- 
lumbia, Baltimore, and Philadelphia County. (e) In the 
field of vital statistics, the Bureau carries on extensive activi- 
ties such as the following: (1) work toward a uniform vital 
statistics law for the various states and the use of standard- 
ized birth and death certificates; (2) revision of the Inter 
national List of Causes of Death ; (3) collection of hospital 
and institutional statistics; (4) collection of marriage an 
divorce statistics ; (5) collection of accident statistics ; and ( 
special field projects for the improvement in registration 0 
vital statistics. Extensive research is carried on in the Bureau 
for the improvement of census work and the development © 
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sampling techniques, upon which the further expansion of 
census work must perhaps be largely dependent. 

Most of the data and results of these extensive activities 
are made available through numerous periodic publications, 
annual reports, and special reports and releases.* ‘The most 
generally useful of these publications is perhaps the StatiSti- 
cal Abstract of the United States, published annually. This 
volume of about 1000 pages is perhaps the most valuable 


compilation of basically important social data to be found, 


anywhere in the United States. The Census Bureau is like- 
wise by far the most important organization in the United 
States, and perhaps in the world, for extending, improving, 
and standardizing the collection of the kind of social data 
upon which intelligent administration as well as compr 
hensive social research are increasingly dependent. The in- 
quiry into housing conditions in the census of 1940, and the 
availability of census tract data from this census for some 
sixty cities will greatly increase the possibilities of refined and 
significant analysis of census statistics. 

Extensive as are the activities of the Bureau of the Census, 
it is by no means (as we shall see below) the only agency, even 
of the federal government, engaged in the compilation of 
official statistics. For the year ending June go, 1938, there 
were received some ten million income tax and informa- 
tional returns and some seventeen million returns in con- 
nection with the old-age insurance program. Altogether 
about 9714 million administrative returns were made to the 
Federal government by individuals, institutions, and cor- 
porations. In addition about 8 million non-administrative 
returns were made during the same year, of which about 
12 million were in connection with the census of unemploy- 
ment. In a census year, the figure for non-administrative re- 
turns is, of course, greatly increased. The following account 
of the numbers of each type of return during a non-census 
year gives some idea of the subjects covered.™* 


5 For a partial list of these, see the Report of the Secretary of Commerce, 


1940, pp. 57-58. ; 
5* Report of the Central Statistical Board, (House Document No. 27, Gov- 


ernment Printing Office, 1939), PP- 3 7- 
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It is estimated that about 21,000,000 returns were filed for 
farms, making an average of approximately three returns per 
farm during the year. A total of 60,000,000 returns were received 
from industrial, commercial, and financial enterprises (not in- 
cluding farms but including all other types of business) or an 
average of about go per enterprise. On the average, each of these 
business returns contained answers to about 65 questions. ‘These 
averages conceal wide differences among different classes of busi- 
ness. As might be expected, the returns, in general, were most 
extensive for regulated enterprises and for large-scale unregulated 
enterprises. average number of returns filed per railroad 
was more than a thousand, and the average number of questions 
answered per return was about 150. For a group of large-scale 
enterprises, including manufacturers of automobiles, electrical 
apparatus, and steel, the average number of returns per respond- 
ent was about 250, while the average number of questions 
answered per return was 65. In contrast with these large con- 

rns, the average independent retail store filed about 10 returns 
with the Federal Government and answered about 50 questions 
per return.® 


If we add to this vast fund of information the reports col- 
lected by state and local governments, we gain some idea of 
the extent of government bookkeeping even in a country in 
which private enterprise is still supposed to be dominant. 
F is true, of course, that reports to the government have 
multiplied greatly during the past decade with the increased 
centralization of government and a rapid expansion of its 
functions. As Ogburn says: “Statisticians were called to 
Washington in hordes, and some who were not statisticians. 
The demand far exceeded the supply. From 1930 to 1938 
the number of statisticians in government service in Wash- 
ington is reported to have increased 700 per cent.” 7 


D. TRENDS TOWARD INCREASING SCOPE AND STANDARDIZATION 
OF SOCIAL STATISTICS 


In addition to the comprehensive activities of the Census 
Bureau and the administrative governmental agencies which 
have been reviewed above, recent years have also seen in 
creasing attention to the collection and standardization of 
social statistics by other branches of the government and na 
tional associations of private agencies. We summarize 


Ibid, p- 10. 
jour. Amer. Statis, Asso, 35:256, 1940. 
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low some of the more important of these developments dur- 
ing the decade 1930-40. 

1. Uniform reporting of social work statistics. The need 
for comparable data regarding the work of the vast number 
of public and private agencies engaged in social service, re- 
lief, and other social work was recognized in this country at 
least as early as 1874.8 A successful program toward that 
end, however, was not begun until 1924, by the Community 
Chests and Councils, Inc. The Russell Sage Foundation also 
promoted the cause by sponsoring a system of standardized 
voluntary reporting of such statistics, beginning in 1926. 
These undertakings led to the establishment in 1928 of a 
registration area for the collection of social work statistics. 
This work was at first under the auspices of a joint commit- 
tee of the University of Chicago and the Association of Com- 
munity Chests and Councils, but was soon transferred to the 
United States Children’s Bureau. The service was further 
transferred to the Social Security Board in 1936. Since the 
establishment of this system of registration, forty-four cities 
have been brought into the registration area. The reporting 
covers the entire field of social work and also hospitals and 
out-patient departments. Financial as well as service data 
are included. 

In addition to the data rendered available through the 
above system of registration, attention should be called to a 
number of handbooks published by various national agencies 
which define terms and describe statistical procedures. Such 
handbooks are now available for statistics of legal aid (1924), 
child guidance clinics (1930), public health nursing (1932); 
medical social service (1933), family welfare (1934), and hos- 
pitals (1935). A series of handbooks on similar and addi- 
tional subjects was published by the New York State Depart- 
ment of Welfare during 1935 and 1936. : 

The Social Work Year Book, 1941, summarizes the present 
situation as follows : 

Despite the improvement both in quantity and in quality of the 
statistical reporting by social agencies since 1924, 1t 1$ still impos- 
sible to mi for the United States a comprehensive statement 

8 See Emil Frankel, “Standardization of Social Statistics,” Soc. Forces, 5:2447 
45, 1926. 
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of the volume and cost of public and private social services such 
as that published for the British Social Services by the Committee 
on Political and Economic Planning. Even in New York State, 
where the reporting is more complete than in any other state, 
there are gaps in reports from the voluntary social agencies in 
certain fields. Only in the field of public relief is it possible to 
publish each month for the ted: States as a whole, and for 
each state and local community, the number of cases, households, 
or perone receiving assistance and the amount of assistance re- 
ceived. 

The present fairly complete state of public relief statistics is 
the result of cooperation among state and local agencies achieved 
under the leadership of five federal agencies (United States Chil- 
dren’s Bureau, Federal Emergency Relief Administration — 
liquidated June go, 1938, Work Projects Administration, Social 
Security Board, and Central Statistica Board) with the encourage- 

t of the Joint Committee on Relief Statistics of the American 
i ae Welfare Association and the American Statistical Associa- 
ion. 


2. The Children’s Bureau. In addition to its early spon- 
sorship of centralized reporting of social statistics as reviewed 
above, the Children’s Bureau has been prominent in the 
development of systems of reporting many additional types 
of information. Extensive studies of maternal and infant 
deaths will be found among the publications of the Bureau. 
Its statistics of work permits issued to children of school age 
since 1920 provide an index of the decline of child labor. 
The collection and publication of standardized juvenile 
court statistics since 1927 is another contribution of the Bu- 
reau. The organization of nation-wide reporting of maternal 
and child health, crippled children, and child welfare serv- 
ices are among the Bureau’s present projects. 

3 The Federal Emergency Relief Administration and 
the Work Projects Administration. Statistics on public un- 
employment relief from 1933 to 1935 were collected with 
practical completeness by the Federal Emergency Relief Ad- 
ministration. Monthly reports from counties were secure 
through state offices covering case loads, relief granted, ad- 
ministrative costs, and sources of funds. With the with- 
drawal of federal grants for general relief in 1935, the reports 
from state and local units were unfortunately impaired. The 


* Social Work Year Book, 1941, p. 486. 
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Works Progress Administration continued to publish monthly 
Statistics, making its own estimates for areas for which re- 
ports were inadequate or lacking. This reporting system for 
general relief was transferred in 1937 to the Social Security 
Board. The W.P.A. continued to compile the statistics of 
the Works Program. 

In addition to the above collection of statistics the Division 
of Social Research of this federal agency is constantly engaged 
in special studies, and has published a large number of ex- « 
cellent monographs, most of them based on extensive field 
studies. We have referred to some of these contributions in 
previous chapters. 

4. The Social Security Board. The Social Security Bul- 
letin, published monthly, is today the most important source 
on public relief statistics for the United States and for each 
state. The Bulletin not only continues the statistics of the 
W.P.A. mentioned above, but integrates them with other 
statistics of other public aid programs, and arrives at an esti- 
mated unduplicated count of households and persons receiv- 
ing relief. The statistical series of private agencies trans- 
ferred from the Children’s Bureau is also continued by the 
Board. More detailed relief data are collected from public 
agencies in cities of 400,000 or more population. The Social 
Security Bulletin also publishes a variety of data and analyses 
bearing on its special functions. A fede 

5. The Bureau of Labor Statistics. This organization is 
the principal source of reliable data on a wide variety of sub- 
jects concerning employment, conditions of work, wages, and 
related subjects. These data are published in the Monthly 
Labor Review, which also publishes periodically a cost of liv- 
ing index number constructed from extensive reports of re- 
tail prices of different commodities throughout the nation. 

6. The Central Statistical Board. — This Board was estab- 
lished by executive order in 1933, to coordinate, improve, 
and develop the vast statistical activities of the government. 
The Board has served in an advisory capacity in all the major 
developments in government statistics since that time. In 
1938 the Board began a comprehensive study of the various 
returns made by the public to the government with a view 
of avoiding duplications and otherwise improving statistical 
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services. The Board recommended, among other more de- 
tailed suggestions, that although the collection of various 
types of statistics should be left for the most part to the ad- 
ministrative or other agencies chiefly concerned with the sub- 
ject in question, this decentralization makes a strong statis- 
tical coordinating agency with adequate powers all the more 
desirable.1° The functions of the Board were taken over, in 
1940, by the Division of Statistical Standards of the Bureau 
of the Budget. 

There are, of course, a large number of governmental as 
well as private organizations engaged in the collection of so- 
cial statistics in their respective fields, which have not been 
mentioned above. A complete list would obviously include 
such agencies as the Office of Education, the United States 
Public Health Department, the Department of Agriculture, 
and the very large number of agencies dealing with strictly 
economic statistics. In the latter field, also, a very large 
number of private organizations such as the National Bureau 
of Economic Research and the National Industrial Confer- 
ence Board should be mentioned. Even a brief review of all 
of these agencies, together with the large number of state 
and municipal organizations devoted to the recording of $0- 
cial experience in the form of statistics, is obviously beyond 
the scope of this chapter Our purpose has been not to 
provide a directory of statistical sources but to call attention 
to the principal movements under way for promoting unl 
form, standardized accumulation of such data. 


E. OBSTACLES TO COMPREHENSIVE AND STANDARDIZED SOCIAL 
BOOKKEEPING 


The adequate accumulation of social data for the guidance 
of society is, like most social problems, a problem of sock 
8 tion. As such, the standardization of social statistics 
advocated in this chapter is retarded or prevented by many 
of the same obstacles which stand in the way of efficient 
1 See Report of the Central Statistical Board v City i 
An excellent exa; of what is bein tedy g t ae A ees along 
the lines here under will be found in L. Wirth and Margaret 
Furez (Editors), Local Community Fact Book, 1998, (Chicago Recreation 
ission, 1938). The studies by H. W. Green for the Cleveland Heal 
Council are further examples of a very important type of social research. 
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organization in any field : tradition, inertia, and cultural lag. 
It is difficult to convince the great masses of people of the 
usefulness and necessity ofthe impersonal and abstract sym- 
bolic devices on which we must rely to an increasing degree 
for our control over our more removed social environments. 
The great majority of people are still organized on the basis 
of the simple face-to-face relationship of primary groups. As 
a result they observe and retain the significant data of past 
experience for the guidance of their future action by the 
informal and subjective method of casual observation and 
memory. The external storage of past experience in sym- 
bolic form—statistics and written records of every kind— 
which adjustment in the larger society requires, seems to the 
great masses of men to be an abstract, unreal, and unreliable 
process. The frequent inadequacy and unreliability of pres- 
ent records lends weight to the popular suspicion of statistics 
in all its forms. This in turn is largely due to the incompe- 
tence and carelessness of some persons at present engaged in 
recording and interpreting statistics. Comprehensive organ- 
ization for the recording of social experience on a large scale 
such as the growing extent and complexity of our social con- 
tacts require is not an easy program to which to convert the 
general public. 

In the first place, the increasing size of the task demands 
an increasing financial expenditure. With the public atti- 
tude what it is, more adequate provision for recording and 
bookkeeping in the public service and in private social agen- 
cies is likely to be the first to suffer from programs of econ- 
omy. Even the Federal Census Bureau is severely handi- 
capped financially in its efforts to tabulate and render useful 
many of the data which it collects. This is a situation which 
must to a large extent wait for its remedy on a change in 
public attitude. In the meantime the great private Founda- 
tions are performing a most valuable service in developing 
the technique of adequate recording as well as in educating 
social agencies and the public generally to the need and value 
of social bookkeeping. 

In the second place, the divided jurisdiction of our system 
of government with its division of power between local, state, 
and national governments renders the adoption of uniform 
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methods of recording social data extremely difficult. The 
purposes for which a local community records certain data 
may not be as’ broad as the purposes for which state or na- 
tional governments may desire these data. The data of the 
local community may therefore be inadequate for these larger 
purposes. Or, the data collected in different communities 
may be for the same purpose, but may be collected in differ- 
ent units and forms so as to render them incomparable. 
Only a central agency with mandatory powers such as the 
national Census Bureau can entirely remedy this situation. 
In the meantime, as we have seen, much excellent work has 
been accomplished in this direction through the voluntary 
_ cooperation of the states, local units, and private agencies with 
central agencies. Standardization on a national and inter- 
national scale for many types of data is the goal to be sought. 


F. SOCIAL CHANGE AND STANDARDIZED STATISTICS 


Aside from the problems of social organization and au- 
thority which are involved in the standardization of social 
statistics, there are certain problems which arise from the 
changing social order itself. In our attempts to standardize 
units and methods of recording for long periods of time we 
are confronted with the fact that the changing social order 
renders certain units obsolete and demands the creation 0 
new ones. Thus the census classification of occupations 
shows the disappearance of certain occupations and the ap 
pearance of others, Certain occupations have practically dis- 
appeared owing to mechanical inventions, and others have 
been absorbed by the rise of some new occupation. Or, the 
functions which used to be performed by people of certain 
occupational designation may have greatly altered, although 
the occupational name remains the same. For any Or all of 
these reasons, a study of occupational classification of the 
present population and that of the middle of the last century 
might be very difficult and unreliable in its conclusions. The 
incomparability of statistics over a long period of time is a 
problem which will arise under the most carefully organiz 
system of recording. New purposes which data are to serve, 
as well as new types of data, will dictate changes in units 
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from time to time. Under such circumstances the most we 
can do is either (1) to retain both the old and the new units 
and methods of recording if comparability with previous fig- 
ures in the particular case ûnder consideration is regarded as 
sufficiently important to warrant such duplication ; or (2) to 
base the new classification, when possible, upon the old in 
such a way as to make the old and the new units subject to 
interpretation in terms of each other. 


G. CONCLUSION 


We have considered in this chapter some examples of the 
organizations which contemporary society has developed for 
the systematic recording of certain types of human behavior. 
These types of behavior have always been the object of infor- 
mal observation and comment. Rough generalizations from 
these observations in the form of folklore have also come down 
to us from very ancient times. That is, age distributions, sex 
ratios, morbidity, mortality, marriage»rates, and other char- 
acteristics of populations and uniformities of social behavior 
have always been a source of primary interest. It was per- 
haps also for this reason that these data were among the first 
to be formally and systematically recorded. In any case, it 
happens that on these subjects we have relatively reliable 
records, representing some millions of individual observa- 
tions, extending in some countries over three centuries. We 
are so accustomed to the systematic collection of these demo- 
graphic data that we take them more or less for granted. 

As a result of the possession of these records we are able to 
demonstrate with great reliability the orderlinéss and pre- 
dictability of certain demographic phenomena. This knowl- 
edge has in turn relieved us from the notions that these 
events are governed by personal, whimsical, or vindictive 
agencies of some sort. More especially has this type of knowl- 
edge enabled us to predict and make community provisions 
(e.g., through insurance) for the events in question. The 
accurate prediction of the future size of populations, age dis- 
tributions, marriage rates, €tc., is obviously of basic impor- 
tance in any social planning of such facilities as housing and 
educational and economic opportunities. Finally, it is in the 
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field of demography that we have made most progress toward 
the establishment of reliable generalizations of the type at 
which all sciénce aims. 

The superior development of positive, verifiable, socio- 
logical knowledge in the field of demography has erroneously 
led to the assumption that data of the type mentioned above 
are in some intrinsic sense more “objective” than it is pos- 
sible for certain other data of social behavior to become. 
As we attempted to show in the first chapter, however, ob- 
jectivity is not a quality or an attribute of data per se. Ob- 
jectivity refers rather to the character of the symbols with 
which we communicate about phenomena. When these 
symbols have verifiably similar meanings to all men, we call 
the phenomena to which the symbols refer objective. It is 
not necessary to deny that such objective designations may 
be easier to invent for some phenomena than for others. 
To approach this question profitably, however, we would 
need to know the amount of time and effort devoted to such 
attempts for different types of phenomena throughout the 
centuries. It is sufficient for our purpose to point out that the 
advancement of science has consisted of gradually rendering 
“objective” increasingly large areas of traditionally “sub- 
jective” phenomena. We have merely assumed in this book 
that this process will continue also in the future and espe- 
cially in the social sciences. 

We have reviewed in preceding chapters some recent meth- 

_ ods by which such “subjective” phenomena as attitudes, opin- 
lons, morale, and Status have been objectively defined and 
measured. Likewise the extensive development of tests for 
the measurement and prediction of human intelligence, abil- 
ities, and achievement, represents steps toward the objecti- 
fication of human characteristics that have been assumed to 
be “by nature” subjective and unmeasurable. There ap- 
pears to be no reason why all of the above mentioned char- 
acteristics of populations and others should not be system- 
atically recorded and take their place with other demographic 
data which are today regarded as necessary for intelligent s0- 
cial administration or scientific advance. It is only compara- 
tively recently that statistics of illiteracy have made their 
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appearance among demographic data. Still more recently 
have we recognized that for many purposes “I Q”. or “men- 
tal” age distributions are more important than the readily 
available chronological age. Such data, together with sta- 
tistics on the distribution of wealth, migrations, rural-urban 
residence, etc., are already well recognized demographic data 
which we need to record as systematically as the traditional 
population, birth, death, and marriage statistics. The anal- 
ysis of all of these data over periods of time and their cor- 
relation with each other and with other phenomena have 
yielded the most reliable and verifiable generalizations which 
the social sciences have thus far attained. When data of sim- 
ilar extent and reliability become available also for an un- 
limited variety of other characteristics and social behaviors 
of numerous different cultures, generalizations of demonstra- 
ble scientific validity will be more prevalent in the social 
sciences.!2 

The greatly increased registration of relatively standard- 
ized types of social data advocated above does not, of course, 
contemplate the abandonment or reduction of most of the 
special types of research considered through this book. On 
the contrary, a broader and more adequate foundation of 
basic data of the type we have here called demographic would 
greatly implement and stimulate the more academic and 
“purely scientific” research. The burden of collecting first 
hand the necessary basic data to test really important socio- 
logical hypotheses and theories is, as every researcher knows, 
frequently prohibitive. If the great extension of the statisti- 
cal services which we have reviewed in this chapter seems it- 
self to involve prohibitive amounts of time, personnel, and 
expense, three considerations should be remembered : (1) In- 
creasing reliance upon scientific sampling will greatly reduce 
the size of such an undertaking as the decennial census with- 
out impairing its value. (2) The rapid development of sta- 
tistical machinery easily makes possible the classification and 
more refined analysis of the amounts of data which would be 


12 A beginning toward a more comprehensive and systematic accumulation 
of data - mie different cultures is contemplated by the qatar 
Survey under way at Yale University. See G. P. Murdock, e 
Cultural Survey,” Amer. Sociol, Rev., 5:961-70, 1940. 
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quite unmanageable without such mechanical aids.1* (3) As 
for the expense of a considerable extension and improvement 
of registration and gathering of standard social data, it is 
perhaps sufficient to point out that the census of 1940 in the 
United States cost much less than our latest battleship. 

Finally, there will be those who are depressed by the “in- 
humanly dispassionate sweep of the scientific quest” and who 
will be impatient of the laborious and mundane methods 
proposed in this book. To those who feel this way we can 
only say with Veblen : 


One may approve or one may deprecate the fact that this 
opi ue, materialistic interpretation of things pervades modern 
hinking. That is a question of taste, about which there is no 
disputing. The prevalence of this matter-of-fact inquiry is & 
feature of modern culture, and the attitude which critics take 
toward this phenomenon is chiefly significant as indicating how 
far their own habit of mind coincides with the enlightened com- 
mon-sense of civilized mankind. It shows to what degree they 
are abreast of the advance of culture. Those in whom the savage 
predilection or the barbarian tradition is stronger than their 
abituation to civilized life will find that this dominant factor 
of modern life is erse, if not calamitous ; those whose habits 
of thought have been fully shaped by the machine process ani 
scientific inquiry are likely to find it good. The modern western 
culture, with its core of matter-of-fact knowledge may be better 
or worse than some other cultural scheme, such as the classic 
Greek, the medieval Christian, the Hindu, or the Pueblo Indian. 
Seen in certain lights, tested by certain standards, it is doubtless 
better ; by other standards, worse. But the fact remains that the 
py cultural scheme in its maturest growth, is of that com 
Pr hich: its characteristic force lies in this matter-of-fact insight. 
ts highest discipline and its maturest aspirations are these.* 


H. SUGGESTIONS FOR FURTHER STUDY 
1. For a brief history of the development of social statis 


18“It is estimated that the following machines will be used for decennial 


census work : manual key punch hi duplicating key pun 
machines, 12 Aoba Lea OCEN gh Mita: gamete itiplying 
punch machine, 11 alphabetic © punch machines, 1 automatic mulUPp 


verifying machines, 92 šorti 
69 accounting machines, a 
Secretary of Commerce, 1940, p. 47. ; 

14 Thorstein Veblen, T Place of Science in Modern Civilization, (viking 
Press, New York, 1932), pp. 29, 30. 
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tics, see George A. Lundberg, “Statistics in Modern Social 
Thought,” (Chap. 6 in Contemporary Social Theory, edited by 
H. E. Barnes, H. and F. B. Becker, Appleton-Century, 1940). 
This chapter also contains references to the principal sources. 
See also “Proceedings of the Centenary Celebration, 1839- 
1939,” Four. Amer. Statis. Asso., 35:213-308, 1940. 

2. The best bibliographical reference and general digest 
of international population statistics and studies is the Popula- 
tion Index, published quarterly by the School of Public 
Affairs, Princeton University, and the Population Association 
of America, Inc. 

3. For a report on attempts to coordinate official statistics 
in New York State, see R. L. Gillett, “Progress in State Sta- 
tistics,” Jour. Amer. Statis. Asso., 35:335-40, 1940. 

4. An excellent summary of the extensive statistical ac- 
tivities of the United States Government will be found in 
“The Statistical Services and Activities of the United States,” 
in The Inter-American Statistical Compendium, (Central 
Statistical Board, Washington, D. C., May, 1940). See also 
Statistical Activities of the American Nations, edited by 
Elizabeth Phelps under the direction of the Inter-American 
Statistical Institute, Census Bureau Building, Washington, 
D. C., 1941. For research suggestions, see P. M. Hauser, 
“Research Possibilities in the United States Census,” Amer. 
Sociol. Rev., 6:463-70, 1941- 4 

5. An excellent analysis of the type of cultural resistance 
which the scientific'approach to human affairs is likely to en- 
counter will be found in M. Goran, “The Literati Revolt 
Against Science,” Phil. of Science, 7: 379-84, 1940. 
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